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A Russian renaissance? 


Vladimir Putin’s promise to increase research spending is welcome — but his country’s scientific 


system needs a complete overhaul. 


of the protesters filling Moscow’s streets after the disputed 

parliamentary poll in December and the equally contested 
presidential elections on 4 March. The protests, the largest in the 
country for 20 years, reflect the alienation of Russia's intelligentsia 
and rising urban middle-class from its inbred political leadership, 
and the autocratic tendencies scarcely veiled behind its assertions of 
democracy. Yet the crowds’ grievances do mirror some of the prob- 
lems that are keeping Russian science down. 

There is no doubt that the scientific system seriously lacks money. 
Vladimir Putin’s pre-election promise to pump funding into basic 
science and innovation (see page 253) as part of his plan to modernize 
and diversify Russia's stagnating economy, which is heavily depend- 
ent on energy exports, is welcome. His spending promises — and his 
opponents’ general lack of interest in science — may explain why many 
Russian academics and working scientists voted to return Putin to the 
presidency. 

But Russian science is undergoing a crisis that runs deeper than 
money: the country’s publication output has declined in the past dec- 
ade despite a slight recovery of the still-meagre public science budgets. 
More than 20 years after the end of the Soviet Union, Russia's large 
research community is fundamentally divided over how it would like 
the domestic science funding system to operate. 

Russian scientists of all ages — not just those who have come of 
age since the fall of the Soviet Union — are sincerely inclined towards 
liberal democracy and open intellectual discourse. They believe that 
scientific systems based on peer-reviewed grant applications, although 
fiercely competitive, are preferable to any form of prescribed knowl- 
edge production. But an influential class of old-school academicians 
and science administrators would rather return to communist secret 
practices than accept the norms and standards of what they see as 
Western science. Peer review? Project evaluation? Perish the thought. 

It is mostly the latter group — together with Russia's omnipresent 
Federal Security Service, formerly run by Putin — that has created the 
stifling bureaucracy and perplexing jungle of regulations and restric- 
tions that many aspiring Russian scientists, let alone their foreign col- 
laborators, have learned to hate. The travel and security restrictions that 
some Russian physicists are currently protesting against are just the most 
recent example of the harassment to which the apparatchiks subject 
scientists (see Nature http://dx.doi.org/10.1038/nature.2012.9921; 
2012). Worse, favouritism and corruption pervade science and higher 
education — there is bribery from the lowest levels, with students paying 
teachers in exchange for passing exams, to the highest. 

If science is to have a constructive role in shaping Russia's future, 
Putin must tackle these problems as forcefully as possible. Economists 
say that a key test of his leadership will be how far he is prepared to go 
to reform the economy; his agenda should also include kick-starting 
overdue scientific reform. Rather than relying on the advice of an 


Te plight of Russian science was far from the minds of most 


exclusive inner circle of buddies and dignitaries, as he has in the past, 
Putin should set up a truly independent scientific advisory council, 
ideally involving foreign scientists, to guide him through the neces- 
sary changes. Russia's partnership with the Massachusetts Institute of 
Technology in Cambridge, which will help to set up a new research 
university in Skolkovo, near Moscow, is a 


“Putin should first step in that direction. The creation of a 
set up atruly well-funded granting agency for university 
independent research, with transparent and fair proce- 
science advisory — dures, would be a perfect sequel. 


council.” Russia has its own rich traditions and 
intellectual culture. It need not imitate the 
scientific landscape of any foreign nation, but the blindness of parts 
of its scientific establishment to the necessity of change borders on 
negligence. It is time for that establishment to throw overboard the 
stubborn obscurantism that stands in the way of a renaissance of 


Russia’s proud scientific past. m 


The shared burden 


A proposed change to Germany’s constitution 
is needed for the future health of the universities. 


why the 16 German states signed up to a reform in 2006 that pre- 

vented the federal government from directly injecting cash into 
their universities. Federal funds had previously been used to share 
the considerable burden of financing infrastructure and large equip- 
ment in universities, for which the states have general responsibility. 
The reform made such sharing impossible. And that left universities, 
particularly those in poorer states, to deteriorate in quality. 

Both federal and state governments now agree that the situation 
must be reversed — urgently, given the relative underfunding of Ger- 
man universities. A proposal on 4 March by education and research 
minister Annette Schavan aims to do just that. 

But the required (two-word) change to the German constitution is 
opposed by some political parties and certain states. Opponents say 
that the reform should not be limited to universities, and should allow 
the federal government to co-finance schools too — a demand that 
Schavan cannot accept. 

To understand the situation, one has to look back to Germany’s 1949 
post-war constitution, which was designed to ensure that a dictator 
could never again seize centralized power. This involved the creation of 
a highly federalized country with politically strong state governments. 


I: is hard to find anyone who can offer a sensible explanation for 
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In particular, responsibility for education, the most sensitive political 
issue, was placed exclusively in the hands of the states. The constitu- 
tion allowed the federal government to offer only short-term project 
money to universities. 

This admirable philosophy was not without problems. Universi- 
ties in the more cash-strapped states, such as Saarland in the west 
and Mecklenburg-Vorpommern in the east, could not compete with 
universities in rich states such as Bavaria and Baden-Wiirttemberg in 
the south. And frustrated successive federal governments were unable 
to exert political influence to force universities to raise their game and 
become competitive. Despite Germany’s general wealth, barely half a 
dozen of its universities regularly appear in the Shanghai Jiao Tong 
University ranking of the world’s top 100 universities. The two most 
successful ones — the Technical University of Munich (now placed 
forty-seventh) and the Ludwig Maximilians University of Munich 
(placed fifty-fourth) — are in Bavaria. The University of Heidelberg 
in Baden-Wirttemberg comes in third, at sixty-second. 

For decades, the federal and state governments have colluded to 
make exceptions to the strict federalization of universities in ways 
— such as in infrastructure — that did not allow the federal govern- 
ment to exert political influence. A 2005 agreement to create the 
multibillion-euro Excellence Initiative — which will expire in 2017 
after two rounds — was something of an exception. The competi- 
tion for graduate schools, research clusters and the coveted title of 
‘elite university’ indirectly allowed the federal government to exert 
its political will. 

So why the 2006 decision to stop money flowing to universities 
from the federal government? Probably because the 2006 federal 
reform was a much wider-ranging attempt to clarify responsibilities 


between governments on both levels that had, over the decades, been 
rendered fuzzy by the introduction of innumerable exceptions to 
the divisions laid out in the 1949 constitution. Universities were a 
tiny part of this. Although experts saw the problems immediately, 
they could not persuade politicians to delay a highly complicated 
agreement to negotiate an apparent detail. 

German schools certainly need greater 


“The federal investment, and the federal govern- 
government ment is well placed to help. But in this 
could do even sensitive area, the states do not want the 
more to steer federal government to share in decisions 
academic science on how its donated money is spent. This 
in Germany is unacceptable to the government, which 
towards sustained until 2006 had always participated in deci- 


sions about the co-financing of university 
infrastructure and equipment through 
its inclusion in the German Council of Science and Humanities, a 
high-level committee of scientists and politicians. 

The debate over schools could involve years of painful negotiation, 
and German universities should not be made to wait. If the Social 
Democrats and Greens drop their opposition to the two-word addition 
to the constitution now, that change could be in place by March 2013. 
They should do so. The federal government could then, for exam- 
ple, directly ensure the survival beyond 2017 of successful institutes 
created in the Excellence Initiative, even if the states in which they are 
located cannot give them permanent financing. And the government 
could do even more to steer academic science in Germany towards 
sustained competitiveness — not to usurp the states’ power but to give 
them financial breathing space. m 


competitiveness.” 


Final say 


Ongoing controversy over work at Japan’s 
Tohoku University must be resolved. 


But as a case at Tohoku University shows, it is especially difficult in 
Japan — particularly when the scientist concerned is the university's 
president. 

When concerns were raised in 2007 about irregularities in the 
research of Akihisa Inoue, who specializes in making metallic glasses, 
the university formed a committee to look into the matter. The com- 
mittee determined that there was no case to answer and that the matter 
need not be investigated formally. But the decision satisfied few of 
his doubters, who criticized the committee for being heavily popu- 
lated by university administrators whom Inoue had appointed to their 
posts (but not to the committee). The university says that there was 
no conflict of interest. 

Four years later, and with claims of problems in Inoue’s papers 
accumulating, the university formed another committee. This time 
it got an eminent outsider, Akito Arima, former president of Tokyo 
University, parliamentarian and minister of education, to lead. The 
committee focused on retractions of six papers on which Inoue had 
been an author. 

The committee concluded that the duplicate submissions that led to 
the retractions were an example of poor, but somewhat accepted, prac- 
tice in the field of materials science. Duplicate publications, the report 
said, were bad, but “different from practices such as FFP [falsification, 
fabrication and plagiarism], which clearly have a negative impact on 
research.” In a formal statement, Inoue apologized and said that the 
university would quickly implement the report’s recommendations. 

The judgement was surprisingly light. Six retractions is a serious 


E is not easy for universities to investigate their own researchers. 
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matter, although they may simply indicate that Inoue — who has 
more than 2,500 papers to his name — was trying to move too fast. As 
Inoue has told Nature, his laboratory did not always check its papers 
to ensure that its research was cited properly. 

Then, in April 2011, the Japan Science and Technology Agency 
(JST) formed a committee to find out whether the retractions, cor- 
rections and other allegations had undermined the results of a project 
it had funded between 1997 and 2002. It found no reason to question 
the main findings, but concluded that 5 of 27 high-impact papers 
were duplicated in other papers by Inoue’s laboratory. But neither of 
these latest committees, as both acknowledge, investigated the more 
serious allegations against Inoue concerning irreproducibility of 
results (see Nature 470, 446-447; 2011). Inoue denies any manipu- 
lation of data, and there is no evidence that he has committed any 
scientific misconduct. 

The JST, in accordance with its policies, has now requested that the 
university look into the allegations. The university says that it will do 
so, but will give no details about the composition of the committee, or 
about the investigation’s mission, scope or expected date of completion. 

Unlike countries such as the United States, Japan has no final arbiter 
for such issues. The Office of Research Integrity in Rockville, Mary- 
land, for instance, as well as many funding agencies, can aggressively 
investigate allegations. Even if they don't satisfy everyone, their deci- 
sions are final, and for the most part fair. 

The primary responsibility for investigating should always remain 
with the university, research institute or relevant scientific society. 
But Japan needs a procedure to oversee the committees that make 
these judgements. The authors of the JST report suggest that the 
Science Council of Japan (if given the necessary funding to repeat 
experiments) could fit this role. Such a body seems worthwhile, and 
it could very well have closed the arguments at 
Tohoku University that refuse to die even after 
nearly five years. Japan’s policy-makers should 
use the situation at Tohoku to create a mecha- 
nism to handle situations such as these. = 
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hundred years. The list of great British ideas to fall flat on 

their faces in the United States is a lengthy one. It is a roll- 
call of infamy that sprang to mind last month, when plans emerged to 
establish a Science Media Centre in the United States. 

The original Science Media Centre (SMC) began in London in 
2002. It offers the media a clearing house for scientific briefings and 
packaged quotes from scientists. On its own terms, it has been an 
outstanding success. A combination of factors lies behind this impact: 
the energy of the centre's staff; the backing of sponsors, including sci- 
entific societies, major corporations and most governmental and non- 
governmental research funders; and a close-knit London media circle. 

Science media centres similar, but not identical, to that in London 
already exist in locales as far apart as Australia, 
Canada and Japan. But it seems to me, as some- 
one who has worked as a reporter and an editor 
on both sides of the Atlantic, that there are for- 
midable obstacles to a successful introduction of 
the concept to the United States. 

For a start, the ‘problem’ regarding science 
and the public is different in each place. The 
London SMC was set up because UK scientific 
leaders were upset that environmentalists had 
successfully fought the introduction of geneti- 
cally modified food; they felt that the UK media 
were too susceptible to environmental scare 
stories about new technologies. 

The main public-relations challenge facing 
science in the United States is different — bit- 
ter social and political division over stem-cell 
research, global warming, creationism and 
much else besides. A US Science Media Centre would either avoid 
these highly partisan issues — and face irrelevance — or step 
right into them, and take a level of heat that the UK SMC has never 
experienced. 

Second, US journalists, justifiably or not, have higher self-regard 
than their British counterparts and are likely to take strong issue with 
the ‘churnalism aspects of the SMC. Under pressure as US reporters 
may be, they don't want to share ‘pooled’ quotes. 

Third, the entire media picture in the United States is much larger 
and more diverse. There isn’t the direct competition between titles 
that obliges the main London newspapers to run so many stories every 
day, in case readers miss something. Broadcasting dwarfs print media 
in its US reach, and is itself now being eclipsed by a plethora of new 
media — ranging in scope from an unread Tweet 


M arks & Spencer. Oasis. Football (proper football) for the first 


to The Huffington Post — as the principal source NATURE.COM 

of news. Discuss this article 
Supporters say that the US SMC would reflect __ online at: 

this scale and diversity, doing most of its work _ go.nafure.com/kliuna 


US SCIENCE FACES 
A BITTER SOCIAL 
AND POLITICAL 


DIVISION 
OVER STEM 
CELLS, GLOBAL 
WARMING AND 


CREATIONISM. 


. % Two nations divided by 
a common purpose 


Plans to replicate Britain’s Science Media Centre in the United States are 
fraught with danger, warns Colin Macilwain. 


online. But it is hard to see how a digital SMC would differentiate itself 
from other interest groups. 

Finally, the top echelons of the US media — what Newt Gingrich 
would call the ‘liberal media elite’ — are particularly sensitive to the 
nature of their sources of information. If an organization in Washing- 
ton or New York, co-funded by government and big business, tried to 
sell them a line, they would recoil. 

The London SMC’s narrow approach to risk assessment — if you want 
to hear about the risks of nuclear power, say, just ask your local nuclear 
engineer (see Nature 471, 549; 2011) — sits happily with the prevalent 
ethos of British journalism. This was, of course, immortalized by the 
otherwise-obscure poet Humbert Wolfe: “You cannot hope | to bribe or 
twist, | thank God! the | British journalist. | But, seeing what | the man 
will do | unbribed, there's | no occasion to.” 

Despite the fears of the SMC founders, the 
British press — led by the BBC, which treats 
the Confederation of British Industry with the 
deference the Vatican gets in Rome — is over- 
whelmingly conservative and pro-business in 
its outlook. It is quite unperturbed by the fact 
that SMC sponsors include AstraZeneca, BP, 
Coca-Cola, LOreal, Monsanto, Syngenta (as 
well as Nature Publishing Group) but nota single 
environmental non-governmental organization 
(NGO) or trade union. 

Fiona Fox, the SMC’s director, says that the 
centre operates independently of its sponsors 
and points out that none (except its host, the 
Wellcome Trust) accounts individually for more 
than 5% of its income. She adds that no NGOs 
are involved because it was their public-relations 
skills that the founders of the SMC sought to match. But the perception 
that the environmental group Friends of the Earth constitutes a big- 
ger threat to scientific truth-telling than some of the corporate names 
on the SMC’s sponsorship list is not one the US media would accept. 

Some of those considering a US centre share these concerns. They 
think that their funding model will have to rely on charitable trusts, 
not companies or government agencies. And they see the US SMC as 
a source of background information and advice to help reporters to 
get out and do their jobs: talking to their own sources and obtaining 
their own quotes. That agenda would be less ambitious — and less 
fraught with danger. 

This spring, reporters and funders in the United States will be con- 
sulted on whether the Science Media Centre can mimic the transatlan- 
tic success ofa certain four-piece band from Liverpool. I fear, instead, 
that the fate of Robbie Williams awaits. Who, you ask? Well, exactly. m 


Colin Macilwain writes about science policy from Edinburgh. 
e-mail: cfmworldview@gmail.com 


15 MARCH 2012 | VOL 483 | NATURE | 247 


© 2012 Macmillan Publishers Limited. All rights reserved 


Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


VISUAL & WRITTEN/SUPERSTOCK 


Rain slows 
seismic waves 


The recent occurrence of 
earthquakes and even heavy 
rainfall reduce the near-surface 
velocity of a type of seismic 
wave responsible for the worst 
damage during earthquakes. 
Nori Nakata and Roel 
Snieder at the Colorado 
School of Mines in Golden 
studied these ‘shear’ waves 
using seismic interferometry, 
which measures how the waves 
propagate from depth to the 
surface. The authors analysed 
the average velocities of shear 
waves in nearly 112,000 
quakes — most of them minor 
— recorded between 2000 and 
2010 by Japan's network of 
seismic stations. They found 
that, three months after major 
earthquakes, wave velocities 
recorded at nearby stations 
were 3—4% lower than normal. 
In southern Japan, shear- 
wave propagation was also 
slower during rainy summers 
than drier seasons, possibly 
because of the increase in fluid 
pressure that occurs when 
the ground is infiltrated by 
rainwater. 
J. Geophys. Res. http://dx.doi.org/ 
10.1029/2011JB008595 (2012) 


PHYSIOLOGY 


Sawrfish snouts 
sense and stab 


The serrated snouts of sawfish 
have an unusual double role: 
the fish use them to both 
detect and disable prey. 

A number of fish species 
have elongated ‘noses’ that they 
use either to detect or to attack 
food. In studies of captive 
juvenile freshwater sawfish 

(Pristis microdon), Barbara 


Outback palms were planted 


The Australian outback’s only palm trees, 
thought to be living fossils of the ancient 
Gondwanan rainforest that covered the region 
15 million years ago, may in fact be descended 
from transplants carried by humans. 

Toshiaki Kondo at Hiroshima University and 
his colleagues compared DNA samples from four 
populations of the outback’s Livistona mariae, 
which give their name to the Palm Valley Oasis, 
with those of other Australian palms. They found 
that the species is genetically almost identical to 


Wueringer at the University 
of West Australia in Crawley 
and her colleagues show that 
sawfish do both. 

The animals have a network 
of electrical sensors in their 
saws (pictured), and the team 
found that a weak electric 
dipole suspended in the water 
to mimic the field created by a 
prey item elicits saw-waving. 
But the fish were also filmed 
using their saws to slice mullet 
and chunks of tuna before 
consumption. 

Curr. Biol. 22,R150-R151 (2012) 


ORGANIC ELECTRONICS 


Sterilization- 
ready transistors 


The striking flexibility of 
organic electronics means that 
they hold immense promise for 
use in biomedical devices such 
as catheters. But most organic 
materials melt or degrade when 
heated to the temperatures 
needed to sterilize materials, as 
required for medical use. 

Takeo Someya at the 
University of Tokyo and his 
colleagues have built a flexible 
organic thin-film transistor 
that is heat resistant and works 
at the low voltages that might 
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its northern relation, Livistona rigida — meaning 
that the two are not distinct species. Genetic 
analysis revealed that the populations started to 
diverge between 7,000 and 31,000 years ago, a 
timeline that overlaps with intermittent human 
migrations from northern to central Australia. 
The authors suggest that Aboriginal people 
may have brought the palm seeds with them as a 
food resource or raw material. 
Proc. R. Soc. B http://dx.doi.org/10.1098/ 
rspb.2012.0103 (2012) 


be used in the human body. 
They heated the device, which 
consists of a thiophene-based 
aromatic active layer with 
aluminium oxide and alkyl 
phosphonic acid insulating 
layers, up to 150°C and found 
that its electrical characteristics 
remained stable. 

Nature Commun. 3,723 (2012) 


CANCER GENETICS 


Many genomes in 
one tumour 


Separate parts of a single 
tumour can contain distinct 
genetic mutations, and 
clinicians might suggest 


M. CRISP/AUSTRALIAN NATL UNIV. 


contradicting prognoses or 
treatments on the basis of 
biopsies from different areas. 
Charles Swanton at Cancer 
Research UK’s London 
Research Institute and his 
colleagues analysed this 
diversity by reconstructing 
the evolution of four kidney 
cancers, sampling several parts 
of each primary tumour, as well 
as secondary ones from organs 
to which the cancer had spread. 
In one patient, only one- 
third of the total mutations 
were found in all the samples, 
and one-quarter were unique to 
just one. This has implications 
for biomarker analysis and 
cancer-genome studies, which 
rely on single biopsies. It 
might also explain why many 
cancer treatments eventually 
stop working: they affect only 
part of a tumour, allowing the 
remainder to bounce back. 
N. Engl. J. Med. 366, 883-892 
(2012) 


Venusian hot flow 
anomalies 


A giant aneurysm of plasma 
distorted Venus’s atmosphere 
on 22 March 2008, a space- 
weather event known as a hot 
flow anomaly (HFA). Such 
events occur when electric 
fields associated with the 
Sun’s solar wind create a fast- 
growing bulge that is filled 
with hot ions from the bow 
shock, a boundary between 
the solar wind and the planet. 

Venus joins Earth, Saturn 
and Mars as planets where 
HFAs have been observed, say 
Glyn Collinson at the Goddard 
Space Flight Center in 
Greenbelt, Maryland, and his 
colleagues, who used data from 
the European Space Agency's 
Venus Express satellite to 
identify the 2008 event. 

On Earth, HFAs can cause 
auroras. The authors suggest 
that HFAs would be even 
more disruptive to Venus’s 
atmosphere because the planet 
lacks a magnetic field, meaning 
that the bow shock, and so the 
HFA, would be much closer in. 
J. Geophys. Res. http://dx.doi.org/ 
10.1029/2011JA017277 (2012) 


IMMUNOLOGY 


Controlling 
natural killers 


Immune cells long considered 
lowly foot soldiers on the front 
line against bacteria and fungi 
turn out to have a commanding 
role. Researchers report that 
neutrophils, the most common 
white blood cells in humans, 
control the activity of another 
antimicrobial white blood cell 
— the natural killer cell. 
Sophie Ugolini and Eric 
Vivier of Aix-Marseilles 
University in France and 
their colleagues isolated 
a mutant mouse whose 
natural killer cells responded 
poorly to stimulation. The 
mutant, they found, lacked 
mature neutrophils. Without 
neutrophils to guide them, 
mouse natural killer cells 
failed to mature or function 
properly. The same was true 
of natural killer cells in human 
patients with low neutrophil 
counts, a condition known as 
neutropenia. 
J. Exp. Med. http://dx.doi. 
org/10.1084/jem.20111908 
(2012) 


What killed the 
big beasts? 


During the past 100,000 years, 
many of Earth's largest animals 
— including ground sloths, 
mammoths, woolly rhinos and 
sabre-toothed cats — became 
extinct. Scientists have debated 
why for decades, with climate 
change and hunting by humans 
the chief suspects. Graham 
Prescott, David Williams and 
their group at the University of 
Cambridge, UK, have created 
a model of unprecedented 
geographical breadth — and 
concluded that it took both 
factors to seal the beasts’ fate. 
The researchers modelled 
extinctions in North America, 
South America, Palaeoarctic 
Eurasia, Australia and New 
Zealand, running simulations 
with climate data from ice cores 
and thousands of plausible 
combinations of human- 
arrival and species-extinction 
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COMMUNITY 


CHOICE 


James Crow on sabotaging sperm 


> HIGHLY READ 
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A recent paper identifies a cause for sperm 
silés with puzzlingly high rates of mutation that 
6 Mar cause a particular thyroid cancer syndrome. 


But the Perspective that accompanied it was 
viewed more often than the work itself, perhaps because its 
author, James Crow, a distinguished geneticist at the University 
of Wisconsin, Madison, died in January, aged 95. 

Those who read Crow’s last paper found the cause of the 
high mutation rate summarized in his clear and pleasant 
prose. Rather than dividing asymmetrically — to produce one 
sperm and one proto-sperm cell like the parent — some cells 
‘cheat’ and produce two proto-sperm cells instead. Each of 
these can divide asymmetrically, doubling the cheat’s number 
of descendants and creating clusters of mutations linked to 


the effect. 


PLoS Genet. 8, €1002535 (2012) 


For an obituary of James Crow, see go.nature.com/ty6xi5 


times, reflecting the large 
uncertainty in both these 
estimates. The recipe that 
best predicted the pattern 
of extinctions included 
both climatic and human 
ingredients. 

Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1113875109 (2012) 


MATERIALS 


Anew way to 
3D shapes 


Materials that can change their 
shape on demand have many 
applications, and, thanks to a 
team in the United States, an 
extra means of production. 
Christian Santangelo, 
Ryan Hayward and their 
colleagues at the University 
of Massachusetts at Amherst 
report that they have devised a 
way to produce polymer sheets 
that can transform into pre- 
determined three-dimensional 
shapes and back again. 

The researchers created 
gel sheets from polymers of 
N-isopropylacrylamide and 
benzophenone, a molecule 
that creates crosslinking 
between polymers when 
exposed to ultraviolet (UV) 
light. These sheets swell in 


water, but shrink when heated. 
The team patterned the gel 
using a mask and an initial 
low dose of UV to define the 
polymer’s overall shape, then 
used a higher dose through a 
different mask to create highly 
crosslinked regions in which 
swelling is reduced. By allowing 
more swelling in some areas 
than others, the authors created. 
objects that can switch between 
sheets and spheres when heated 
and cooled (pictured), as well 
as others that can reversibly 
form caps, cones and more- 
complex surfaces. 
Science 335, 1201-1205 (2012) 
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SEVEN DAYS 


| _RESEARCH 
Telescope rivals 


A scientific panel has narrowly 
recommended South Africa 
over Australia as the best 

site for the proposed Square 
Kilometre Array (SKA), a 
US$2.1-billion international 
radio telescope (see Nature 
480, 308-309; 2011). But the 
project’s voting member states 
have yet to make the final call 
on where the telescope will go. 
Despite the SKA Site Advisory 
Committee’s recommendation, 
both sites are still in contention, 
a source told Nature. The final 
choice could come as soon as 

4 April. See go.nature.com/ 
gwfejy for more. 


Tevatron ¥ Higgs 


Before it was shut down last 
year, the Tevatron particle 
collider at Fermilab in Batavia, 
Illinois, gathered data from 
proton-antiproton collisions 
that seem to support the 
existence of the elusive Higgs 
boson predicted by the 
standard model of physics. 
The results, announced on 

7 March at the Moriond 
conference in La Thuile, 

Italy, do not reach a level that 
physicists would consider 
statistically significant. But 
they are consistent with 
reports in 2011 of a Higgs 
particle with a mass of around 
125 gigaelectronvolts, from 
experiments at the Large 
Hadron Collider at CERN, 
Europe’ high-energy physics 
lab near Geneva, Switzerland. 
See go.nature.com/cd3rma 
for more. 


POLICY 


FDA improvement 
Scientists who work for the US 
Food and Drug Administration 
have greater respect for their 
leaders and more confidence 

in the agency’s future than they 
did five years ago, according 

to asurvey of 997 researchers 


The news in brief 


Success for China’s neutrino test 


The final parameter needed to quantify the rate 
at which neutrinos change (‘oscillate’) from 
one form to another has been precisely 
measured by a neutrino-detection experiment 
in southern China. The Daya Bay Reactor 
Neutrino Experiment (pictured), on the 
Dapeng Peninsula near Hong Kong, monitors 
electron antineutrinos produced by nearby 


(see go.nature.com/cv5gi6) 
published on 7 March by the 
Union of Concerned Scientists 
in Washington DC. But 
researchers are still uneasy 
about their rights to publish 
work that might be considered 
contentious, and to talk to the 
press and public about their 
findings, the survey showed. 
They were also worried 

about corporate and political 
influence on the agency’s 
decisions. 


Antarctic repair 

The Brazilian government last 
week laid out plans to rebuild 
and modernize the country’s 
Antarctic research station, 
which was mostly destroyed 
bya fire on 25 February. 
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No money has yet been 
allocated for construction, 
but the science ministry 

has committed more than 
US$5.5 million in new grants 
to rebuild research projects, 
and funds to replace lost 
scientific equipment. 

See go.nature.com/wluzf1 
for more. 


Oil-spill restoration 
Restoration efforts in the Gulf 
of Mexico are likely to receive 
around 80% of the pollution 
fines that the US government 
collects from oil giant BP as 

a result of the 2010 oil spill in 
the region. On 8 March, the 
US Senate passed a bill (the 
RESTORE the Gulf Coast 
Act) that would approve this 
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nuclear power plants. Its measurement, 
reported on 8 March, pips efforts by reactors 

in Japan, France and the United States. It 

also validates experiments that plan to 
compare the rates of neutrino and antineutrino 
oscillations, to test the symmetry between 
matter and antimatter. See go.nature. 
com/9yssr1 for more. 


allocation of funds, which 
would otherwise go to the US 
Treasury. Federal fines, which 
are separate from private 
settlements, are expected to 
range from US$5 billion to 
$20 billion. The bill is now 
attached to a transportation 
bill that awaits passage 
through Congress. 


Water scarcity 
The world needs to rethink 
its policies on managing 
freshwater resources, 
which are being severely 
threatened by climate change 
and a growing population, 
warned the United Nations 
Educational, Scientific and 
Cultural Organization on 
12 March. The Paris-based 


XINHUA/PHOTOSHOT 
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organization launched 

its triennial World Water 
Development Report at the 
start of a meeting to discuss 
the problem in Marseilles, 
France. See page 256 for more. 


PEOPLE 
Brain prize 


Two women who studied 
genes involved in hearing 
share this year’s Brain Prize, a 
€1-million (US$1.3-million) 
annual award given to 
European neuroscientists 
since 2011 by the Grete 
Lundbeck European Brain 
Research Foundation in 
Copenhagen. Christine Petit 
of the College of France and 
the Pasteur Institute in Paris 
identified genes underlying 
different types of deafness. 
Her co-winner, Karen Steel 
of the Wellcome Trust Sanger 
Institute in Hinxton, UK, 
mapped key mouse genes 
involved in the function of 
cochlear hair cells, which pass 
sounds to the brain. 


Nobel chemist dies 
Frank Sherwood (‘Sherry’) 
Rowland (pictured), 
co-winner of the 1995 Nobel 
Prize in Chemistry, died on 

10 March aged 84. One of the 
world’s most famous climate 
scientists, in 1974 he reported 
with Mario Molina that 
chlorofluorocarbons (CFCs) 
were helping to destroy the 
ozone layer. Rowland advocated 


To reduce pressure on fish stocks 
to a sustainable level by 2015, the 
worldwide fishing effort must be 
scaled back by 36-43% of 2008 
levels, according to an analysis 
by researchers at the Food and 
Agriculture Organization of the 
United Nations in Rome (Y. Ye 
et al. Fish Fish. http://doi.org/ 
hq8; 2012). That would mean 
cutting the jobs of between 

12 million and 15 million fishers 
(see chart), and would cost up 

to US$358 billion — but it 
would increase production in 
the long term. 


banning the chemicals; his work 
led to the Montreal Protocol 

in 1987. He continued to study 
atmospheric pollution until his 
death, and won the Nobel prize 
for his ozone research. Rowland 
was also a founder member of 
the University of California, 
Irvine. See go.nature.com/ 
hpmb§j for more. 


Ocean dives 


Mixing showbiz, adventure 
and science, movie director 
James Cameron says that he 
intends this month to dive 
solo to the bottom of the 
11-kilometre-deep Mariana 
Trench in the Pacific Ocean. 
On 8 March, Cameron, 
together with the US 

National Geographic Society, 
announced his plans — as part 
of a documentary — to take the 
first manned visit to the trench 
since the Trieste submersible 
visited the Challenger Deep 
region in 1960. British 
billionaire entrepreneur 


Richard Branson is also hoping 
to reach the trench, although 
his Virgin Oceanic mission is 
suffering delays. 


Retraction spree 


A Japanese anaesthesiologist 
who published work without 
ethics approval from an 
institutional review board has 
been fired by his university 
and had eight papers retracted, 
with many of his others under 
suspicion. Toho University in 
Tokyo said on 6 March that 

it had dismissed Yoshitaka 
Fujii on 29 February after an 
investigation. On 7 March, the 
journal Anesthesia & Analgesia 
published an expression of 
concern about 24 other papers 
by Fujii, and said that its 
editors had spent two years 
following up allegations of 
research misconduct — one 
made as early as 2000 but not 
followed up until recently. 


| __BUSINESS 
Indian generics 


Ina landmark ruling, the 
Indian Patent Office has 
allowed a domestic firm to 
sell a generic version ofa 
patent-protected anti-cancer 
drug so that the treatment 

is affordable. Under a 
compulsory licence granted 
on 12 March, German 
pharmaceutical giant Bayer 
must allow Natco Pharma in 
Hyderabad to make and sell 
its patented drug Nexavar 


TOO MANY FISHERS, NOT ENOUGH FISH 


To meet 2015 sustainable-development targets for healthy fish stocks, 
between 12 million and 15 million fishers will have to lose their jobs. 
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*WSSD: The 2002 World Summit on Sustainable Development 


SEVEN DAYS | THIS WEEK 


19-23 MARCH 

The latest developments 
in nuclear-powered 
spaceflight are 
discussed at the 
Lunar and Planetary 
Science Conference 
in The Woodlands, 
Texas. It marks the 
50th anniversary of 
the launch of the first 
nuclear-powered 
satellite. 
go.nature.com/inloal 


20-21 MARCH 
Chemists, biologists 

and regulators meet in 
Brussels to discuss the 
transition to animal-free 
toxicity testing. 
go.nature.com/ptvih2 


(sorafenib) in return for 

6% royalties on net sales. 
Natco will charge patients 
8,800 rupees (US$176) a 
month; Bayer was charging 
280,000 rupees. Bayer may 
challenge the licence, which 
is permitted under the 
World Trade Organization's 
Agreement on Trade-Related 
Aspects of Intellectual 
Property Rights. 


Space weather 


A strong geomagnetic storm 
hit Earth on 9 March last 
week, three days after the 

Sun spat outa large solar 

flare. No serious damage 

was reported, although such 
solar flares can cause radio 
blackouts and affect satellite 
navigation systems. About 
200 geomagnetic storms 

of this intensity (G3) are 
expected for every 11-year 
solar cycle, according to 

the US government’s Space 
Weather Prediction Center in 
Boulder, Colorado. As the Sun 
approaches its solar maximum 
in 2013, more are expected. 


> NATURE.COM 
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Modellers look 
to the past to prepare for UN 
report p.254 


An X-ray mission to 
reveal the black hearts of 
galaxies p.255 


the map for Rio p.256 


Putting the 
world’s water crisis on 


The quest to 
create a caffeine-free 
coffee bean p.264 
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Vladimir Putin is proposing to set up world-class institutes and broaden distribution of research funding. 


Putin promises 
science boost 


But Russian researchers are sceptical of ambitious schemes. 


BY QUIRIN SCHIERMEIER 


sa teary-eyed Vladimir Putin celebrated 
A: triumphant win in the Russian presi- 
dential elections last week, the country’s 
scientists expressed mixed feelings about the 


implications for research. 
Putin, leader of the United Russia party and 


currently prime minister, will in May replace 
Dmitry Medvedev as president — a post that 
he previously held in 2000-08. Science had 
only a marginal role in the election campaign, 
but Putin outlined his ambitions for research 
in the newspaper Vedomosti on 30 January. 
His government, he wrote, will aim to estab- 
lish several “world-class” research universities 


57 


by 2020. Many rectors and chancellors of 
Russian universities are delighted, and cam- 
paigned for Putin. Indeed, there are high hopes 
for the planned Skolkovo Institute of Science 
and Technology, a graduate research university 
set to be established in a science city outside 
Moscow, in partnership with the Massachu- 
setts Institute of Technology in Cambridge. 
But critics say that Putin’s goal is out of reach, 
given that there are currently no Russian insti- 
tutions in the top ranks of world university 
league tables. 

Putin also promised a substantial increase in 
public funding of basic and applied research. 
The Russian science funding system, often 
criticized for cronyism and a lack of fair grant- 
assessment procedures, must become more 
transparent and open to genuine competition, 
he wrote. Some researchers are cautiously opti- 
mistic, acknowledging that conditions have 
improved in recent years, thanks to modest 
funding and salary increases. “I do think that 
science plays a serious role in Putin’s thinking 
about Russia's future,’ says Raul Gainetdinov, 
a neuroscientist at the Italian Institute of Tech- 
nology in Genoa, who left Russia in 1996. “If 
opportunities at home were to really improve, 
I'd be more than happy to return” 

But although Russian research budgets 
have increased since their near-collapse in the 
1990s, the country’s output of scientific papers 
has declined since 2000, putting it below India, 
Australia and Canada, according to a 2010 
analysis by global data-provider Thomson 
Reuters. Funding opportunities are poor, with 
only a select few scientists able to benefit from 
flagship projects such as the space programme, 
or the ‘mega-grants’ launched in 2010 to attract 
leading researchers to the country (see Nature 
465, 858; 2010). For the majority of Russian 
scientists, project grants are almost impossible 
to get. 

“T receive a salary from the Russian Acad- 
emy of Sciences, but there wasn’t any money 
available last year for buying equipment, doing 
field research or travelling to international 
conferences,” says Leo Borkin, a herpetologist 
at the academy’s prestigious Zoological Insti- 
tute in St Petersburg and former head of the 
St Petersburg Association of Scientists and 
Scholars. “Young scientists here have basically 
no opportunity to make ties with colleagues 
from North America, Europe or Asia,’ he says. 
“Tt’s a very sad and stupid situation” 

Putin has promised to increase the 
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> amount of grant money distributed by 
funding agencies from around 15 billion 
roubles (US$500 million) a year to 25 billion 
roubles by 2018. The average size of grants 
awarded by the Russian Foundation for Basic 
Research, for example, currently some 350,000 
roubles (US$12,000) per year, will be “made 
comparable to Western grants’, he wrote. 
Researchers welcome more cash, of course, 
but the plan implies that the number of grants 
might actually go down, notes Konstantin 
Severinov, who runs independent groups 
at the Russian Academy of Sciences (RAS) 
institutes for molecular genetics and gene 
biology in Moscow, and Rutgers University 


in New Brunswick, New Jersey. 

Putin also intends to break the long-stand- 
ing dominance of the academy, which employs 
around 50,000 scientists at more than 400 
research institutes. He plans to redistribute 
some of the RAS’s budget — currently about 
50 billion roubles per year — to give other 
institutions and universities more money, as 
a key part of a sweeping ten-year science plan 
that he has asked the academy to develop. 

Last month, the RAS administration made 
a start by asking leading scientists at its insti- 
tutes to compile lists of research results that 
they expect to produce by 2030, including cost 
estimates. “This is just not how science works,” 


says Severinov. To him, the unrealistic request 
is typical of the stifling bureaucracy preva- 
lent in the Russian science system. “There are 
undeniably some advances in how science 
is run in this country. Alas, there are lots of 
missed opportunities as well,” he says. 
Mikhail Gelfand, a Moscow-based bio- 
informatician who in December spoke to a 
100,000-strong crowd of anti-Putin protest- 
ers, believes that, under Putin, a “background 
of omnipresent bureaucracy and corruption” 
will continue to hamper any Russian science 
revival. “It would be naive to expect that science 
alone somehow could, miraculously, blossom 
in this atmosphere,’ he says. m SEE EDITORIAL P.245 


DEA/SCIENCE FACTION/CORBIS 


CLIMATE SCIENCE 


Forecasters look back in time 


As the next IPCC assessment nears, scientists use palaeoclimatic data to hone their models. 


BY JEFF TOLLEFSON 


Greenland or driftwood washing up on 

islands of Canada’s Arctic archipelago, but 
both were a reality some 6,000 years ago. “At 
least seasonally, those areas must have been 
ice-free,’ says Gavin Schmidt, a climate mod- 
eller at NASA’s Goddard Institute for Space 
Studies in New York. The warming was caused 
primarily by cycles in Earth’s orbit, not a spike 
in greenhouse-gas levels, but climate modellers 
are revisiting the long-gone era as they prepare 
to deliver their next global forecast. 

The temperate mid-Holocene epoch is one of 
three episodes included in the World Climate 
Research Programme's Coupled Model Inter- 
comparison Project, which will form the basis 
of next year’s fifth assessment by the Intergov- 
ernmental Panel on Climate Change (IPCC). 
Simulating palaeoclimates is a challenge, but 
many scientists say they have enough data 
from sources such as marine-sediment cores, 
tree rings and cave deposits to evaluate various 
aspects of their models. “It’s important to check 
the ability of our climate models to simulate a 


> 


I: is hard to imagine beaches in northern 


different climate, because our future world 
is also quite different from the present,” says 
Masa Kageyama, a palaeoclimate modeller at 
the Pierre-Simon Laplace Institute near Paris. 

Owing, in part, to past computational limi- 
tations, palaeoclimate simulations have never 
been systematically factored in to the IPCC 
modelling effort. With a July deadline for sub- 
mitting papers approaching, members of the 
IPCC’s physical-science working group gath- 
ered in Honolulu, Hawaii, for a pair of work- 
shops, held over the past fortnight, to discuss 
palaeoclimate modelling and the range of other 
simulations on which their assessment depends. 
Scientists presented early results from the latest 
generation of climate models, which take into 
account factors such as the impact of land use 
and vegetation, as well as an increasingly sophis- 
ticated treatment of atmospheric physics and 
chemistry. The simulations explore the effects 
of a broad range of potential future greenhouse- 
gas concentrations, including an extreme case 
in which the amount of carbon dioxide rises to 
more than three times its current level. 

Some wondered whether the increased com- 
plexity could add to the uncertainties of the new 
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models and create a larger range of climate pro- 
jections, but Jerry Meehl, a climate scientist at 
the National Center for Atmospheric Research 
in Boulder, Colorado, and one of the workshop 
organizers, says the spread of projections is 
“roughly the same’ as that produced by the pre- 
vious generation of models. Ron Stouffer, a cli- 


mate researcher at the 
“It turns out pmertaeran and 
that land-use Re ay 
istration’s Geophysical 
changes were : : 
Fluid Dynamics Lab- 
as large a player in Pri 
il * fuel oratory 1n Princeton, 
as fossi New Jersey, says that 
anche taihl his team’s model has 
were. 


already delivered sur- 
prises on the increase 
in atmospheric carbon dioxide levels. “Tt turns 
out that land-use changes, right up to about 
1950 or even 1970, were as large a player as 
fossil-fuel emissions were,’ he says. “And even 
today they are not trivial” 

More than 20 groups around the world are 
still processing and uploading modelling data 
onto a server network that will be accessible to 
everyone involved in the exercise. So far, more 
than 2 million files and a petabyte of data have 
been uploaded. Organizers expect that amount 
to triple, taking it to roughly 100 times that 
amassed during the last round of modelling 
for the IPCC’s fourth assessment in 2007. The 
amount of data is creating new challenges in 
terms of information management, says Karl 
Taylor, who works with modellers on experi- 
ment design at Lawrence Livermore National 
Laboratory in Livermore, California. “We're 
seeing an awful lot of results,” Taylor says, “but 
were also up against deadlines, and everyone 
is a bit frantic.” = 


ASTROPHYSICS 
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Spacecraft aims to expose 
violent hearts of galaxies 


Low-cost mission will tap into the unexplored upper reaches of the X-ray spectrum. 


BY ERIC HAND 


ho would have thought that a ring- 
side seat at some of the Universe's 
most extreme events could come 


cheap? But by the standards of space-based 
astronomy, the NuSTAR telescope that NASA 
plans to launch as early as this month has 
a modest budget, US$165 million. Yet 
it will be sensitive to the high-energy 
photons produced at the turbulent 
thresholds of supermassive black holes. 
Due to be lofted into orbit by a Pegasus 
rocket launched in mid-air from a carrier jet, 
NuSTAR (Nuclear Spectroscopic Telescope 
Array) is taking aim at an under-explored region 
of the spectrum. It will detect ‘hard’ X-rays at 
5-80 kiloelectronvolts, an energy range between 
the softer, lower-energy X-rays probed by tele- 
scopes such as the Chandra X-ray Observatory 
and the y-rays measured by satellites such as the 
Fermi telescope. The spacecraft, which features 
an array of technical innovations, “will be the 
first mission that can resolve these high-energy 
X-rays’, says Fiona Harrison, an astronomer at 
the California Institute of Technology in Pasa- 
dena and the mission’s principal investigator. 
Hard X-rays are notoriously difficult to 
focus, because they tend to penetrate rather 
than reflect off mirrors, even those coated 
with dense metals such as gold and iridium. So 
Harrison and her colleagues have developed a 
novel material: a sandwich of hundreds of thin 


GOING TO EXTREMES 


Compared with other X-ray telescopes, NUSTAR 
has a larger collecting surface at higher energies. 
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metallic films. The thickness of each layer is 
tuned to reflect photons of a specific energy. 
Each layer reflects a small number of photons, 
but they add up to produce a strong reflection. 
“That's the key technical breakthrough,” says 
Harvey Tananbaum, an astronomer at the 
Harvard-Smithsonian Center for Astrophys- 
ics in Cambridge, Massachusetts, and director 
of the Chandra telescope. “Nobody has been 
there with this kind of sensitivity.” 

Still, the X-rays are so powerful that they 
can be reflected only at glancing angles, and 
the mirrors must therefore be arranged in 
conical shells, like nested Russian dolls, which 
funnel the high-energy photons into a detector 
at the end of the structure. With 133 of these 
mirror shells in each of two units, NuSTAR will 
be more sensitive than the Chandra telescope 
(see ‘Going to extremes’), which has only four 
mirror shells — although, because they are 
stiffer and more finely polished, Chandra can 
perceive objects in greater detail. 

Grazing-incidence optics means that a long 
focal length is needed — and, convention- 
ally, a bulky, costly spacecraft. The $2-billion 
Chandra could barely squeeze into the cargo bay 
of the Space Shuttle. NuSTAR, at 10 metres, will 
be just as long — but it has to fit on the much 
smaller, cheaper Pegasus launcher. The solu- 
tion: the telescope is built around a folded-up 
truss that will grow to full length once NuSTAR 
is deployed in low-Earth orbit. For astronomers, 


A deployable mast will allow the NuSTAR space telescope to image high-energy X-rays. 


the 26 minutes while NuSTAR unpacks itself 
will be the most anxious phase of the mission. 

The telescope will focus on active galactic 
nuclei (AGN), the luminous hearts of distant 
galaxies, in which highly energized particles 
fling off X-rays as they whirl around super- 
massive black holes. Because NuSTAR is sen- 
sitive to the most penetrating X-rays, it should 
be able to spot several hundred new AGN that 
are shrouded from other detectors by dust and 
hot gas. That should help astronomers to get a 
handle on how big and common AGNare, says 
Harrison, which in turn will help to illuminate a 
major cosmic puzzle: what role do they have in 
the growth and evolution of their host galaxies? 

The European Space Agency’s XMM- 
Newton mission, another large X-ray telescope, 
will join forces with NuSTAR to probe AGN. 
The two spacecraft will look for distortions in 
the X-ray spectra resulting from the spin of the 
central black hole. These spin rates, in turn, 
could offer insight into how the black holes in 
some AGN grew to masses of billions of Suns — 
whether by swallowing other AGN in galactic 
mergers or by steadily accumulating material 
from the host galaxy. XMM-Newton is devoting 
10% of its guaranteed observation programme 
time to joint observations with NuSTAR. 

“Tt shows how important this mission is to 
the community,’ says Norbert Schartel, the 
project scientist for XMM-Newton at the Euro- 
pean Space Astronomy Centre in Madrid. = 
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ENVIRONMENT 


Water under pressure 


A UN analysis sets out global water-management concerns ahead of Earth Summit. 


BY NATASHA GILBERT WORLD’S THIRSTIEST 


Just three nations — India, China and the United States — together use about one-third of the roughly 


ater should be at the top of the 4,000 km of water extracted globally each year. In general, water demands in developed countries have 
agenda for the Earth Su itin Rio been declining during the past 20 years, mostly as a result of more efficient use of water resources. 


de Janeiro, Brazil, in June, a United 
Nations report urges. 
The fourth World Water Development Report 
by the UN Educational, Scientific and Cultural TOP 7 
Organization (UNESCO), launched at the GLOBAL WATER 
World Water Forum in Marseilles, France, on CONSUMERS 
12 March, notes that industry, agriculture and 


Rest of 
booming urban populations are putting Earth’s 1. INDIA 2. CHINA 3. UNITED world 
water supplies under unprecedented pressure STATES 52 
(see graphic). Hundreds of millions of people ¥ 
do not have access to clean water, leaving them 
at risk from waterborne diseases. Without : = 
prompt action to improve water-management “a 
policies, the report says, a global crisis looms. 

4. RUSSIA 5. INDONESIA 6. NIGERIA 7. BRAZIL — 


Although the document contains a plethora 
of facts and figures, its authors argue that a lack 
of reliable data on water quality and usage 


has become a stumbling block for efforts to 
strengthen policies and enforce regulations. a ~~ : 
“You cannot properly manage something that 1 te DS ee 


you dont know about,’ says Olcay Unver, coor- Pay, + 


dinator of the UN World Water Assessment 
Programme. But closing the knowledge gap 
will be expensive: building a gauging station 
to measure a river's flow can cost more than 
US$1 million, for example, and the expense 
of ongoing operation can be difficult for poor 
countries to justify. The report recommends 
increasing the use of remote-sensing technolo- 
gies to monitor water quality, but notes that 
these will never completely replace informa- 
tion gathered on the ground. 

The report also focuses on the burgeon- 
ing demands of agriculture. Food production 
already consumes more than two-thirds of 
the world’s extracted water, and food demand 
is expected to rise by 70% by 2050, owing to 
population growth. Research into improving 
crop yields and drought tolerance will help 
nations to meet needs while using water more 
efficiently. 

The report concludes that policy-makers 
must balance the requirements of agriculture 
and industry with the need for sustainable 
sources of clean drinking water by developing 
integrated policies that satisfy all three sectors. 
Michel Jarraud, chairman of UN-Water — a 
grouping of 28 UN organizations including 
UNESCO — says that the group will tell lead- Monhern he aly Caen 
ers at the Rio summit that “the challenges, risks se Saha re Bal 
and uncertainties blocking the road to sustain- 
able development require a collective response 


>» 


by the whole international community”. m 
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ON THE FARM DRILLING DOWN 


Agriculture consumes almost 70% of all extracted water. Animal husbandry is the Countries are increasingly meeting demand by extracting water from non-renewable 
most water-intensive aspect of farming, and causes the greatest disparity in water underground sources. Groundwater extraction has tripled in the past 50 years. India 
consumption between the developed and developing worlds. has had the largest growth, jumping from less than 25 km3 in 1950 to 250 km in 
The United States leads the world in per capita water use attributable to animal 2010: about one-quarter of the global total (other countries shown for comparison). 
products, with each person consuming the equivalent of about 1,200 m3 per year. 
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DYING FOR A DRINK 


Nearly 1 billion people have no access to safe and improved water sources such as boreholes, protected wells, springs and rainwater 
collections. More than 80% of the world’s waste water is not collected or treated, causing millions of deaths from waterborne diarrhoeal 
diseases every year in the developing world. Urban settlements are the main source of pollution, and the challenge will grow as the 
world’s urban population almost doubles to 6.3 billion by 2050. 
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China is investing heavily in research and technology to modernize its agriculture. 


China’s budget 
backs science 


Yet reforms to funding systems and more support for basic 
research are needed, say scientists. 


BY JANE QIU IN BEIJING 


r The Chinese government’s promise 
last week that researchers will enjoy 
another year of increased funding was 

not unexpected, given the country’s relatively 

buoyant economy and high regard for science. 

But some scientists fear that too little money 

will be spent on good-quality basic research, 

and that too much will be wasted in a fund- 
ing system in which good connections often 
trump academic expertise. 

In a speech at the opening session of the 
annual National People’s Congress, China's 
Premier, Wen Jiabao, lauded science and tech- 
nology as key drivers of economic growth 
and individual prosperity, and backed up the 
rhetoric with hard cash. This year, central- 
government expenditure on science and tech- 
nology is set to rise to 228.5 billion renminbi 
(US$36.1 billion), a 12.4% increase on last 
year’s spending, which slightly trails the coun- 
try’s overall projected budget increase of 13.7%. 
Of that science budget, 32.5 billion renminbi 
will go towards basic research, a 10.1% increase 
on the 29.5 billion renminbi spent in 2011. Wen 
also stressed the government's continuing com- 
mitment to improving agriculture, promising 


an additional 10.1 billion renminbi — a 53% 
rise on last year — for developing new agricul- 
tural technologies and modernizing China’s 
seed industry. Further details of funding allo- 
cations will emerge in the coming months, but 
sources close to government say that other 
areas likely to receive increased support include 
information technology, drug discovery, regen- 
erative medicine, renewable energy and the 
exploitation of mineral and fuel resources. 

In the past decade, China's total expenditure 
on research and development (R&D) increased 
by about 20% per year. In 2009, it overtook 
Japan to become the world’s second-largest 
investor in R&D after the United States. “This 
increasing flow of investment has seen a jump 
in the total scientific output; says Cong Cao, 
a science-policy researcher at the University of 
Nottingham, UK. Last year, a study by Britain’s 
Royal Society found that, in 2004-08, China 
produced 10% of the world’s published scientific 
articles, putting it second after the United States. 

However, a study conducted by the World 
Bank and Chinas cabinet 
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produces relatively few high-impact articles, 
and that the majority of Chinese patents con- 
stitute minor novelties rather than genuine 
innovations. These problems are “less about the 
amount of money spent in science than how the 
money is spent’, says Su Jun, a science-policy 
researcher at Tsinghua University in Beijing. 


BACK TO BASICS 

Experts say that one factor holding back China's 
research quality is its modest spending on 
basic research. According to the annual report 
released last month by the National Bureau of 
Statistics of China, the country’s total public and 
private R&D expenditure last year was 861 bil- 
lion renminbi (1.83% of gross domestic prod- 
uct), of which only 39.6 billion renminbi (4.6%) 
was spent on basic research. This falls far behind 
the 15-25% share in most developed countries. 

“Weak capacity in basic science has become 
the main constraining factor for most R&D 
in China, including drug discovery,’ says Liu 
Gang, a pharmacologist at Tsinghua Univer- 
sity. In 2006, China unveiled an ambitious plan 
to invigorate its drug industry, with funding of 
about 20 billion renminbi over 15 years. “I'm not 
optimistic that we will be able to get any block- 
buster drug out of that programme,’ says Liu. 

Some researchers also criticize the large 
share of funding typically given to govern- 
ment-led initiatives. “Genuine scientific 
breakthroughs cannot rely on such top-down 
mega-projects,’ says Mu-Ming Poo, director 
of the Chinese Academy of Sciences’ Insti- 
tute of Neuroscience in Shanghai. The United 
States, by contrast, funnels a large proportion 
of spending to investigator-initiated research. 
In China, such research is mostly sponsored 
by the National Natural Science Foundation, 
but its funding, which amounted to 12.4 billion 
renminbi last year, is only a small proportion of 
the government's science expenditure. 

Lack of fairness and transparency in funding 
decisions — with success often depending on 
personal connections — could also be hold- 
ing back Chinese science. “A huge amount of 
money is often controlled by a small number of 
people, and there are significant irregularities 
in their funding decisions,’ says one Beijing- 
based researcher. And in comparison with other 
research expenditure, investment in human 
capital is limited, with many young scientists 
deterred by poor salaries, adds Su. “Most labs 
in China struggle to find good postdocs, the key 
workforce of research,’ agrees Poo. 

The science ministry is under growing pres- 
sure from the scientific community to reform 
the research funding and appraisal system to 
help address these problems. Many research- 
ers contacted by Nature hope that China will 
make the changes a top priority after its new 
government takes office later this year. “The 
extent to which the reform can be instigated 
and successful will determine the future of 
China’s technological and economic competi- 
tiveness,” says Cao. m 
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Japan fails to settle 
university dispute 


Investigations highlight need for a national, independent 
body to oversee research ethics. 


BY DAVID CYRANOSKI IN TOKYO 


scientist Akihisa Inoue, the president of 
Tohoku University in Japan. 

Last March, an earthquake crippled his 
campus (see Nature 483, 141-143; 2012). 
Since then, he has had to retract a series of 
papers because they contained text that had 
appeared in his previous publications, and has 
faced continuing calls for his resignation from 
the university, which he has rejected. His crit- 
ics, mostly professors at his university, claim 
that some of his work cannot be replicated, and 
that there are irregularities in the data in some 
of his papers (see Nature 470, 446-447; 2011). 

Inoue denies any manipulation of data, and 
there is no evidence that he has committed 
any scientific misconduct. Indeed, with more 
than 2,500 publications to his name, Inoue is 
one of the world’s leading experts in metal- 
lic glasses, materials that are more elastic and 
more resistant to corrosion than metals. He 
has previously told Nature that other research- 
ers may simply lack the skills and experience 
to reproduce some of his lab’s results. 

Yet Inoue’s battle with his detractors is far 
from over. Since January, two inquiries into 
his research have reported their conclusions. 
One offered a rebuke for the duplications; the 
other recommended a further investigation, 
giving his critics renewed vigour. The row 
raises questions about how universities in 
Japan should investigate allegations against 
their most senior staff, given that the country 
has no external body with this responsibility. 

In December 2007, a committee from 
Tohoku University dismissed the need for 
an official investigation into Inoue, on the 
basis that there was no case against him. But 
Inoue’s critics were not appeased, because, 
they argued, the majority of the committee had 
been appointed to their administrative posts 
(but not to the committee itself) by Inoue. The 
committee denied any conflict of interest, add- 
ing that it was able to judge the case fairly. 

Since then, materials scientist Fumio Saito 
at Tohoku has pointed out that the text in 
seven of Inoue’s papers substantially dupli- 
cated work previously published by Inoue’s lab. 
These seven papers have since been retracted. 
Inoue told Nature that the duplications were 
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accidents, or the result of miscommunication 
with co-authors. 

Under pressure to take a closer look at Inoue’s 
work, Tohoku’s directors assembled another 
committee in February 2011. On 24 January, 
that committee’s final report concluded that 
although Inoue was at fault for the duplications, 
and that it should not happen again, the reuse 
of text is, to some degree, accepted practice in 
materials science, particularly in papers that 
draw on conference proceedings. 

The Japan Science and Technology Agency 
(JST), a government funding body that gave 
Inoue ¥2.1 billion (US$26 million at the cur- 
rent exchange rate) to support his research 
from 1997 to 2007, also launched an inves- 
tigation. It commissioned Makoto Misono, 
head of the Japan Union of Chemical Science 
and Technology, to lead a committee to inves- 
tigate whether the duplications and alleged 
data irregularities called any of Inoue’s main 
results into question. Its report, released on 
17 February, concluded that they had not. 

The committee did not investigate any 
other allegations, but suggested that to settle 
the matter once and for all, an independent 
body should conduct an inquiry. The univer- 
sity says it is already conducting the investiga- 
tion requested by the JST, but has declined to 
provide any details about its remit. Inoue has 
declined to comment on Misono’ report, or 
on its call for an independent investigation. 

The broader message of the affair, Misono 
says, is that Japan should give more respon- 
sibility for the oversight of research conduct 
to an independent body such as the Science 
Council of Japan, an advisory body to the 
government. In general, Misono says, “when 
evaluating claims of scientific misconduct, it is 
necessary to have objective and fair investiga- 
tion. Ifa research institution cannot do that, 
it’s necessary to have an independent third 
party.” m SEE EDITORIAL, P.246 


CORRECTION 

The News story ‘Trouble at the text mine’ 
(Nature 483, 134-135; 2012) wrongly 
located BioNOT at the University of 
Wisconsin-Madison instead of the University 
of Wisconsin-Milwaukee. 
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A TALE OF 


Since the 1960s, researchers have been scrutinizing a handful 
of patients who underwent a radical kind of brain surgery. 
The cohort has been a boon to neuroscience — but soon it will be gone. 


WD) EW Te 


BY DAVID WOLMAN 


n the first months after her surgery, shopping for groceries was 

infuriating. Standing in the supermarket aisle, Vicki would look 

at an item on the shelf and know that she wanted to place it in her 

trolley — but she couldn't. “Td reach with my right for the thing 
I wanted, but the left would come in and theyd kind of fight,” she says. 
“Almost like repelling magnets.” Picking out food for the week was a 
two-, sometimes three-hour ordeal. Getting dressed posed a similar 
challenge: Vicki couldn't reconcile what she wanted to put on with 
what her hands were doing. Sometimes she ended up wearing three 
outfits at once. “Td have to dump all the clothes on the bed, catch my 
breath and start again.” 

In one crucial way, however, Vicki was better than her pre-surgery 
self. She was no longer racked by epileptic seizures that were so severe 
they had made her life close to unbearable. She once collapsed onto the 
bar of an old-fashioned oven, burning and scarring her back. “T really 
just couldn't function,’ she says. When, in 1978, her neurologist told 
her about a radical but dangerous surgery that might help, she barely 
hesitated. If the worst were to happen, she knew that her parents would 
take care of her young daughter. “But of course I worried,” she says. 
“When you get your brain split, it doesn’t grow back together” 

In June 1979, in a procedure that lasted nearly 10 hours, doctors cre- 
ated a firebreak to contain Vicki’s seizures by slicing through her cor- 
pus callosum, the bundle of neuronal fibres connecting the two sides of 
her brain. This drastic procedure, called a corpus callosotomy, discon- 
nects the two sides of the neocortex, the home of language, conscious 
thought and movement control. Vicki’s supermarket predicament was 
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the consequence ofa brain that behaved in some ways as if it were two 
separate minds. 

After about a year, Vicki’s difficulties abated. “I could get things 
together,’ she says. For the most part she was herself: slicing vegetables, 
tying her shoe laces, playing cards, even waterskiing. 

But what Vicki could never have known was that her surgery would 
turn her into an accidental superstar of neuroscience. She is one of 
fewer than a dozen ‘split-brair’ patients, whose brains and behaviours 
have been subject to countless hours of experiments, hundreds of sci- 
entific papers, and references in just about every psychology textbook 
of the past generation. And now their numbers are dwindling. 

Through studies of this group, neuroscientists now know that the 
healthy brain can look like two markedly different machines, cabled 
together and exchanging a torrent of data. But when the primary cable 
is severed, information — a word, an object, a picture — presented to 
one hemisphere goes unnoticed in the other. Michael Gazzaniga, a 
cognitive neuroscientist at the University of California, Santa Barbara, 
and the godfather of modern split-brain science, says that even after 
working with these patients for five decades, he still finds it thrilling 
to observe the disconnection effects first-hand. “You see a split-brain 
patient just doing a standard thing — you show 
him an image and he can't say what it is. But he 
Tohearmoreabout —can pull that same object out of a grab-bag,’ Gaz- 
split-brain patients, © zaniga says. “Your heart just races!” 
visit: Work with the patients has teased out 
go.nature.com/knhmxk differences between the two hemispheres, 
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revealing, for instance, that the left side usually leads the way for 
speech and language computation, and the right specializes in vis- 
ual-spatial processing and facial recognition. “The split work really 
showed that the two hemispheres are both very competent at most 
things, but provide us with two different snapshots of the world,’ says 
Richard Ivry, director of the Institute of Cognitive and Brain Sciences 
at the University of California, Berkeley. The idea of dichotomous 
consciousness captivated the public, and was greatly exaggerated in 
the notion of the ‘creative right brain’. But further testing with split- 
brain patients gave a more-nuanced picture. The brain isn’t like a com- 
puter, with specific sections of hardware charged with specific tasks. 
It's more like a network of computers connected by very big, busy 
broadband cables. The connectivity between active brain regions is 
turning out to be just as important, if not more so, than the operation 
of the distinct parts. “With split-brain patients, you can see the impact 
of disconnecting a huge portion of that network, but without damage 
to any particular modules,’ says Michael Miller, a psychologist at the 
University of California, Santa Barbara. 

David Roberts, head of neurosurgery at Dartmouth-Hitchcock 
Medical Center in Lebanon, New Hampshire, sees an important lesson 
in split-brain research. He operated on some of the cohort members, 
and has worked closely with Gazzaniga. “In medical school, and science 
in general, there is so much emphasis on large numbers, labs, diagnos- 
tics and statistical significance,’ Roberts says — all crucial when, say, 
evaluating a new drug. But the split-brain cohort brought home to him 
how much can be gleaned from a single case. “I came to learn that one 
individual, studied well, and thoughtfully, might enable you to draw 
conclusions that apply to the entire human species,” he says. 

Today, the split-brain patients are getting on in years; a few have 
died, one has had a stroke and age in general has made them all less 
fit for what can be taxing research sessions of sitting, staring and con- 
centrating. The surgery, already quite rare, has been replaced by drug 
treatments and less drastic surgical procedures. Meanwhile, imaging 
technologies have become the preferred way to look at brain func- 
tion, as scientists can simply watch which areas of the brain are active 
during a task. 

But to Miller, Ivry, Gazzaniga and others, split-brain patients remain 
an invaluable resource. Imaging tools can confirm, for example, that 
the left hemisphere is more active than the right when processing lan- 
guage. But this is dramatically embodied in a split-brain patient, who 
may not be able to read aloud a word such as ‘par when it’s presented 
to the right hemisphere, but can point to the appropriate drawing. 
“That gives you a sense of the right hemisphere’s ability to read, even 
if it can’t access the motor system to produce speech, Ivry says. “Imag- 
ing is very good for telling you where something happens,” he adds, 
“whereas patient work can tell you how something happens.” 


ACABLE, CUT 
Severing the corpus callosum was first used as a treatment for severe 
epilepsy in the 1940s, on a group of 26 people in Rochester, New York. 
The aim was to limit the electrical storm of the seizure to one side 
of the brain. At first, it didn’t seem to work. But in 1962, one patient 
showed significant improvement. Although the procedure never 
became a favoured treatment strategy — it’s invasive, risky, and drugs 
can ease symptoms in many people — in the decades since it neverthe- 
less became a technique of last resort for treating intractable epilepsy. 
To Roger Sperry, then a neurobiologist and neuropsychologist at the 
California Institute of Technology, and Gazzaniga, a graduate student 
in Sperry’s lab, split-brain patients presented a unique opportunity to 
explore the lateralized nature of the human brain. At the time, opinion 
on the matter was itself divided. Researchers who studied the first 
split-brain patients in the 1940s had concluded that the separation 
didn’t noticeably affect thought or behaviour. (Gazzaniga and others 
suspect that these early sections were incomplete, which might also 
explain why they didn't help the seizures.) Conversely, studies con- 
ducted by Sperry and colleagues in the 1950s revealed greatly altered 
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brain function in animals that had undergone callosal sections. Sperry 
and Gazzaniga became obsessed with this inconsistency, and saw in 
the split-brain patients a way to find answers. 

The duo’s first patient was a man known as W. J., a former Second 
World War paratrooper who had started having seizures after a Ger- 
man soldier clocked him in the head with the butt of a rifle. In 1962, 
after W.J’s operation, Gazzaniga ran an experiment in which he asked 
W,J. to press a button whenever he saw an image. Researchers would 
then flash images of letters, light bursts and other stimuli to his left 
or right field of view. Because the left field of view is processed by the 
right hemisphere and vice versa, flashing images quickly to one side or 
the other delivers the information solely to the intended hemisphere 
(see ‘Of two minds’). 

For stimuli delivered to the left hemisphere, W.J. showed no hang- 
ups; he simply pressed the button and told the scientists what he saw. 
With the right hemisphere, W.J. said he saw nothing, yet his left hand 
kept pressing the button every time an image appeared. “The left and 
right didn’t know what the other was doing,” says Gazzaniga. It was a 
paradigm-blasting discovery showing that the brain is more divided 
than anyone had predicted’. 

Suddenly, the race was on to delve into the world of lateralized 
function. But finding more patients to study proved difficult. Gazzaniga 
estimates that at least 100 patients, and possibly many more, received a 
corpus callosotomy. But individuals considered for the operation tend 
to have other significant developmental or cognitive problems; only 
a few have super-clean cuts and are neurologically healthy enough to 
be useful to researchers. For a while, Sperry, Gazzaniga and their col- 
leagues didn't know if there was ever going to be anyone else like WJ.. 

But after contacting neurosurgeons, partnering with epilepsy cen- 
tres and assessing many potential patients, they were able to identify a 
few suitable people in California, then a cluster from the eastern part 
of the United States, including Vicki. Through the 1970s and the early 


The callosum tissue seen in a healthy brain (bright white in top image) 
retracts after a corpus callosotomy, leaving just the ventricle (black). 
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OF TWO MIND 


Split-brain patients have undergone surgery to cut 
the corpus callosum, the main bundle of neuronal 
fibres connecting the two sides of the brain. 


Experiments with split-brain patients have helped to 
illuminate the lateralized nature of brain function. 


A word is flashed briefly to the 
right field of view, and the patient 
is asked what he saw. 


Now a word is flashed to the left 
field of view, and the patient is 
asked what he saw. 
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fields 


Input from 
the left field 
of view is 
processed 
by the right Corpus 
hemisphere callosum 
and vice 
versa. 
Left Right 
hemisphere hemisphere 


1980s, split-brain research expanded, and neuroscientists became 
particularly interested in the capabilities of the right hemisphere — 
the one conventionally believed to be incapable of processing language 
and producing speech. 

Gazzaniga can tick through the names of his “endlessly patient 
patients” with the ease of a proud grandparent doing a roll call of 
grandchildren — W)., A.A., R.Y., L.B., N.G.. For medical confidential- 
ity, they are known in the literature by initials only. (Vicki agreed to be 
identified in this article, provided that her last name and hometown 
were not published.) 

On stage last May, delivering a keynote address at the Society of 
Neurological Surgeons’ annual meeting in Portland, Oregon, Gaz- 
zaniga showed a few grainy film clips from a 1976 experiment with 
patient P.S., who was only 13 or 14 at the time. The scientists wanted 
to see his response if only his right hemisphere saw written words. 

In Gazzaniga’s video, the boy is asked: who is your favourite girl- 
friend, with the word girlfriend flashed only to the right hemisphere. 


“T HAVE A HARD TIME 
SAYING IT’S ALL OVER.” 


As predicted, the boy can’t respond verbally. He shrugs and shakes 
his head, indicating that he doesn’t see any word, as had been the case 
with W)J.. But then he giggles. It’s one of those tell-tale teen giggles — 
a soundtrack to a blush. His right hemisphere has seen the message, 
but the verbal left-hemisphere remains unaware. Then, using his left 
hand, the boy slowly selects three Scrabble tiles from the assortment in 
front of him. He lines them up to spell L-I-Z: the name, we can safely 
assume, of the cute girl in his class. “That told us that he was capable 
of language comprehension in the right hemisphere,’ Gazzaniga later 
told me. “He was one of the first confirmation cases that you could 
get bilateral language — he could answer queries using language from 
either side” 

The implications of these early observations were “huge’, says 
Miller. They showed that “the right hemisphere is experiencing its 
own aspect of the world that it can no longer express, except through 
gestures and control of the left hand”. A few years later, the researchers 
found that Vicki also had a right-hemisphere capacity for speech’. Full 
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Because the left hemisphere is dominant 
for verbal processing, the patient’s 
answer matches the word. 


Se Lg 


The right hemisphere cannot share information 
with the left, so the patient is unable to say 
what he saw, but he can draw it. 


callosotomy, it turned out, resulted in some universal disconnections, 
but also affected individuals very differently. 

In 1981, Sperry was awarded a share of the Nobel Prize in Physi- 
ology or Medicine for the split-brain discoveries. (“He deserved it,” 
Gazzaniga says.) Sperry died in 1994, but by that point, Gazzaniga 
was leading the charge. By the turn of the century, he and other split- 
brain investigators had turned their attention to another mystery: 
despite the dramatic effects of callosotomy, W.J. and later patients 
never reported feeling anything less than whole. As Gazzaniga wrote 
many times: the hemispheres didn’t miss each other. 

Gazzaniga developed what he calls the interpreter theory to explain 
why people — including split-brain patients — have a unified sense of 
self and mental life’. It grew out of tasks in which he asked a split-brain 
person to explain in words, which uses the left hemisphere, an action 
that had been directed to and carried out only by the right one. “The 
left hemisphere made up a post hoc answer that fit the situation.” In 
one of Gazzaniga’s favourite examples, he flashed the word ‘smile’ to 
a patient’s right hemisphere and the word ‘face’ to the left hemisphere, 
and asked the patient to draw what hed seen. “His right hand drewa 
smiling face,’ Gazzaniga recalled. “Why did you do that?’ I asked. He 
said, “What do you want, a sad face? Who wants a sad face around?” 
The left-brain interpreter, Gazzaniga says, is what everyone uses to 
seek explanations for events, triage the barrage of incoming informa- 
tion and construct narratives that help to make sense of the world. 

The split-brain studies constitute “an incredible body of work’, said 
Robert Breeze, a neurosurgeon at the University of Colorado Hospital 
in Aurora, after listening to Gazzaniga’s lecture last year. But Breeze, 
like many other neuroscientists, sees split-brain research as outdated. 
“Now we have technologies that enable us to see these things” — tools 
such as functional magnetic resonance imaging (fMRI) that show the 
whereabouts of brain function in great detail. 

Miller, however, disagrees. “These kinds of patients can tell us 
things that {MRI can never tell us,” he says. 


SUBJECT OF INTEREST 

Seated at a small, oval dining-room table, Vicki faces a laptop propped 
up ona stand, and a console with a few large red and green buttons. 
David Turk, a psychologist at the University of Aberdeen, UK, has 
flown in for the week to run a series of experiments. 

Vicki’s grey-white hair is pulled back in a ponytail. She wears simple 
white sneakers and, despite the autumn chill, shorts. She doesn't want 
to get too warm: when that happens she can get drowsy and lose focus, 
which can wreck a whole day of research. 
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During a break, Vicki fetches an old photo album. In one picture, 
taken soon after her surgery, she is sitting up in the hospital bed. Her 
hair is starting to grow back as black stubble and she and her daugh- 
ter have wide smiles. Another page of the album has a slightly faded 
printout of a 1981 paper from The Journal of Neuroscience glued into it: 
the first published report involving data gleaned from Vicki, in which 
researchers describe how she, like P.S., had some capacity for language 
in her right hemisphere’. 

When pressed to share the most difficult aspect of her life in science, 
the perpetually upbeat Vicki says that it would have to be an apparatus 
called the dual Purkinje eye tracker. This medieval-looking device 
requires the wearer to bite down on a bar to help keep the head still 
so that researchers can present an image to just the left or right field 
of view. It is quite possible that Vicki has spent more of her waking 
hours biting down on one of those bars than anyone else on the planet. 

Soon, it is time to get back to work. Turk uses some two-sided tape 
to affix a pair of three-dimensional glasses onto the front of Vicki’s 
thin, gold-rimmed bifocals. The experiment he is running aims to 
separate the role of the corpus callosum in visual processing from that 
of deeper, ‘subcortical’ connections unaffected by the callosotomy. 
Focusing on the centre of the screen, Vicki is told to watch as the 
picture slowly switches between a house and different faces — and to 
press the button every time she sees the image change. Adjusting her 
seat, she looks down the bridge of her nose at the screen and tells Turk 
that she’s ready to begin. 


DEEP CONNECTIONS 

Other researchers are studying the role of subcortical communica- 
tion in the coordinated movements of the hands. Split-brain patients 
have little difficulty with ‘bimanual’ tasks, and Vicki and at least one 
other patient are able to drive a car. In 2000, a team led by Liz Franz 
at the University of Otago in New Zealand asked split-brain patients 


Michael Gazzaniga has worked with split-brain patients for 50 years. 
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to carry out both familiar and new bimanual tasks. A patient who 
was an experienced fisherman, they found, could pantomime tying a 
fishing line, but not the unfamiliar task of threading a needle. Franz 
concluded that well-practised bimanual skills are coordinated at the 
subcortical level, so split-brain people are able to smoothly choreo- 
graph both hands”. 

Miller and Gazzaniga have also started to study the right hemi- 
sphere’s role in moral reasoning. It is the kind of higher-level func- 
tion for which the left hemisphere was assumed to be king. But in the 
past few years, imaging studies have shown that the right hemisphere 
is heavily involved in the processing of others’ emotions, intentions 
and beliefs — what many scientists have come to understand as the 
‘theory of mind’®. To Miller, the field of enquiry perfectly illustrates 
the value of split-brain studies because answers can't be found by way 
of imaging tools alone. 

In work that began in 2009, the researchers presented two split- 
brain patients with a series of stories, each of which involved either 
accidental or intentional harm. The aim was to find out whether the 
patients felt that someone who intends to poison his boss but fails 
because he mistakes sugar for rat poison, is on equal moral ground 
with someone who accidentally kills his boss by mistaking rat poison 
for sugar’. (Most people conclude that the former is more morally 
reprehensible.) The researchers read the stories aloud, which meant 
that the input was directed to the left hemisphere, and asked for ver- 
bal responses, so that the left hemisphere, guided by the interpreter 
mechanism, would also create and deliver the response. So could the 
split-brain patients make a conventional moral judgement using just 
that side of the brain? 

No. The patients reasoned that both scenarios were morally equal. 
The results suggest that both sides of the cortex are necessary for this 
type of reasoning task. 

But this finding presents an additional puzzle, because relatives 
and friends of split-brain patients do not notice unusual reasoning or 
theory-of-mind deficits. Miller’s team speculates that, in everyday life, 
other reasoning mechanisms may compensate for disconnection effects 
that are exposed in the lab. It’s an idea that he plans to test in the future. 

As the opportunities for split-brain research dwindle, Gazzaniga is 
busy trying to digitize the archive of recordings of tests with cohort 
members, some of which date back more than 50 years. “Each scene 
is so easy to remember for me, and so moving,” he says. “We were 
observing so many astonishing things, and others should have the 
same opportunity through these videos.” Perhaps, he says, other 
researchers will even uncover something new. 

Other split-brain patients may become available — there is a small 
cluster in Italy, for instance. But with competition from imaging 
research and many of the biggest discoveries about the split brain 
behind him, Gazzaniga admits that the glory days of this field of sci- 
ence are probably gone. “It is winding down in terms of patients com- 
monly tested. Still, he adds: “T have a hard time saying it’s all over.” 

And maybe it’s not — as long as there are scientists pushing to tackle 
new questions about lateralized brain function, connectivity and com- 
munication, and as long as Vicki and her fellow cohort members are 
still around and still willing participants in science. Her involvement 
over the years, Vicki says, was never really about her. “It was always 
about getting information from me that might help others.” m 


David Wolman is a freelance writer based in Portland, Oregon, and 
the author, most recently, of The End of Money. 
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Make tt a decaf 


The enduring quest for a coffee bean without the buzz. 


BY BRENDAN BORRELL 


aulo Mazzafera punched a pea-sized disc out of a waxy green 
coffee leaf, then placed the disc in a small vial with a mixture 
of chloroform and methanol to dissolve it. Later, he loaded 
the extract, along with 95 other samples, into a high-per- 
formance liquid chromatography machine, which separates 
out each chemical component. When the plant physiologist 
returned to his lab at the University of Campinas in Brazil 
the next morning, he sat down at his laptop to examine the 
results. Scrolling from one chromatogram to the next, he scrutinized the peak 
representing caffeine. In one plant, it was missing. 

Mazzafera ran the sample twice more and then, just before noon, called his 
collaborator Bernadete Silvarolla, based at the agricultural station nearby, to share 
the news. “Are you sure?” she asked. He was. In fact, he was thrilled. After screening 
thousands of plants over the course of two decades, his project to find a naturally 
caffeine-free coffee finally seemed to be bearing fruit. That was in late 2003. 

Coffee contains some 2,000 chemical compounds that give the drink its enticing 
aroma and flavour, including caffeine, a stimulant and natural pesticide. Removing 
the caffeine while leaving all the others intact poses a significant challenge. Brewers 
have generally turned to chemistry: Ludwig Roselius of Bremen, Germany, patented 
the first commercial decaffeination process in 1905. But his coffee, marketed as 
Kaffee HAG, used benzene in the extraction process, and the chemical was later 
replaced by less toxic solvents. Today, companies may instead douse raw green 
coffee beans in high-pressure liquid carbon dioxide or soak them in hot water for 
several hours to remove the caffeine before roasting. Aficionados say that all these 
methods destroy the taste, but the decaf market is still worth US$2 billion a year. > NATURE.COM 

Researchers have long sought a better bean, harvested directly from the plant For pictures from the 
caffeine-free. This would preserve coffee’s complex flavour and give growers a field and an interview 
high-end slice of the decaf market. But developing such a bean through conven- with the author, see: 
tional breeding or even genetic modification has proved more difficult than anyone go.nature.com/quscwj 
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anticipated (see “The ups and downs of decaffeinatior’). Coffee plants 
take years to begin producing beans, and can be fickle when they do. 
Moreover, to make them profitable to farm, the plants need to be pro- 
ductive, ripen synchronously and be of a size and shape that can be 
harvested easily by hand or by machines. The loss of any of these traits 
can render a plant worthless. The caffeine-free quest has produced 
a string of high-profile papers, but not a drop of marketable coffee. 

Basic research often proves hard to translate into industrial agricul- 
ture, says Rod Sharp, a retired plant cell biologist who tried to produce 
caffeine-free coffee in the 1980s while at DNA Plant Technology in 
Cinnaminson, New Jersey. “Those in the research lab are always a bit 
naive. We jump up and down when there is a breakthrough, but turn- 
ing that into a commercial operation is another challenge altogether’ 
However, Sharp remains confident that the plant will one day yield its 
secrets. “It will happen, he says. “It’s just a long incubation period” 
This is certainly true for Mazzafera’s plant: more than eight years after 
its discovery, his colleagues are still trying to turn it into a crop. 

Coffee is worth a total of $15 billion to $20 billion per year to export- 
ing countries, which include Brazil, Colombia and Vietnam. A rela- 
tively recent innovation, compared with tea or wine, it dates to about 
the fifteenth century when — according to at least one account — a 
Yemeni mystic described a revitalizing beverage prepared in Ethiopia 
by roasting and boiling berries. Two coffee species dominate the market 
today: Coffea arabica, the better-tasting bean, which grows in cooler 
climates, and Coffea canephora, commonly known as ‘robusta, which is 
used mainly in instant coffee and lower-quality blends. Caffeine, which 
is present at 1.2% in commercial C. arabica and 2-3% in C. canephora, 
has helped to make both species part of a worldwide addiction. 

But the stimulant beloved by many is avoided by others who are 
unusually sensitive to its effects, abstain for religious reasons or simply 
don’t want to be kept awake. Gabriel Bertrand of the Pasteur Institute 
in Paris discovered a caffeine-free species of coffee on Grande Comore 
island near Madagascar in 1901. In fact, many of the hundred or so 
species of Coffea are either caffeine-free or contain low levels of the 
stimulant. Several naturally occurring intermediate-caffeine coffees 
(with 0.6-1% caffeine) have already reached the market, including 
one produced by the Italian coffee-maker Illy. Unfortunately, most 
plants with the lowest caffeine levels produce few 
beans, or contain high concentrations of bitter 
compounds. Nevertheless, the existence of such 
natural variation suggests that a diligent breeder 


Aficionados say that 
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were just wasting time,’ says Mazzafera. 


In 2000, Mazzafera teamed up with Silvarolla, a coffee breeder at 
the IAC. They shifted their focus to a group of C. arabica plants origi- 
nally collected during a 1964 United Nations expedition to Eritrea and 
Ethiopia. Seed samples — 620 in total — were divided up and grown 
in several countries, including Costa Rica. Later, 308 of these lineages 
were collected in Costa Rica and sent to Brazil. Mazzafera believed it 
would be much easier to produce marketable coffee by starting with 
the Ethiopian C. arabica plants than by hybridizing with other species. 

It was from this collection that Mazzafera discovered the promis- 
ing strain in 2003, as well as two more like it. After confirming that, 
like the leaves, the beans were caffeine-free, he worked out that the 
plants were defective in the final step of the chemical pathway that 
turns theobromine — a mild stimulant and diuretic — into caffeine’. 
The Brazilian government offered the research group a $1.2-million 
grant along with an order to keep the location of the precious plants 
under wraps. Mazzafera felt certain that commercial growers would 
be planting the new variety in five years. That is, so long as others 
didn’t get there first. 


BEAN MODIFICATION 

The advent of genetic engineering led many scientists to try to make 
decaf by splicing the right genes into the right beans. But coffee has 
proved resistant to this kind of tinkering. In 1992, geneticist John Stiles 
of the University of Hawaii in Honolulu wanted to use ‘antisense’ tech- 
nology, whereby a gene inserted into the plant reduces production of 
a target protein. This was the technology used to produce the Flavr 
Savr tomato, the first genetically modified organism to be approved 
for human consumption. Stiles’s goal was to target a protein in the 
caffeine-producing pathway. 

Problems cropped up almost immediately. Creating any genetically 
modified plant involves culturing a blob of plant cells on nutrient-rich 
agar, inserting the desired genetic material into those cells, and then 
convincing the tissue to sprout into a plant. For C. arabica, that process 
is mysteriously inefficient. 

Over the next seven years, Stiles and two postdocs, Kabi Neupane 
and Stefan Moisyadi, struggled to overcome the biological roadblocks. 
They produced plants that seemed to have low 
levels of caffeine, and Stiles made some enthusi- 
astic claims. In August 1999, for example, he told 
The Wall Street Journal that he would be begin- 


could create a commercially viable caffeine-free chemical extraction ning field trials that month in Hawaii before 

strain. destroys the taste : expanding to Mexico, with commercial prospects 
after three years. 

COFFEE AND DREAMS but the decaf The plants, however, did not cooperate. As 

Mazzafera began his attempts to do just that in market is still worth they grew, their caffeine levels rose. Moisyadi and 


1983 at the Agronomical Institute of Campinas 
(IAC), a century-old agricultural station in the 
rolling hills northwest of Sao Paolo, Brazil. He set 
out to study the genetics and physiology of caffeine 
biosynthesis under Alcides Carvalho, a pioneering plant breeder who 
established the IAC’s collection of coffee plants, which now number 
70,000 and represent more than 1,000 wild strains, breeding lines, 
hybrids, mutants and cultivated varieties from around the world. 

At first, Mazzafera used an old-school spectrophotometer to meas- 
ure caffeine content one sample at a time. In 1987, he got a job at 
the University of Campinas and installed a high-performance liquid 
chromatography machine in his lab, allowing him to process samples 
more efficiently. By this point, scientists had sketched out the basic 
four-step pathway by which C. arabica synthesizes caffeine. Mazzafera 
studied caffeine production and breakdown in great detail in seven 
species, hoping to find one with defects in a pathway that rendered the 
plant low in caffeine. At the same time, he and Carvalho, who died in 
1993, were cross-breeding commercial coffee cultivars with wild non- 
arabica species low in caffeine. But it proved impossible to eliminate 
caffeine while maintaining the desirable attributes of C. arabica. “We 


US$2 billion a year. 


Neupane moved on to academic careers and, in 
2000, Stiles left the university, setting up a private 
lab in Waialua. In 2008, after disagreements with 
the local community and the state legislature over 
the right to field test his transgenic coffee, the company folded, and 
Stiles now says that he was never 100% certain that he had created 
caffeine-free coffee. Many coffee researchers also doubt that he had. 
“We were always doing this on a shoestring,” Stiles says. “We were 
never Monsanto.’ 

Neither was Shinjiro Ogita, a postdoc in Hiroshi Sano’ lab at the 
Nara Institute of Science and Technology in Japan. In 2001, he began 
a research programme targeting an enzyme in the caffeine pathway 
that had recently been identified in tea. His group used an efficient 
gene-silencing technology called RNA interference, and worked with 
robusta coffee in the hope that cell culture would be easier. It wasn't. 
Few cells took up the engineered DNA. Ogita was able to recover just 
enough to produce 35 transgenic seedlings. 

He tested some leaves and found they had as much as 70% less 
caffeine than his control plants. “It was kind of unbelievable,’ he says. 
He remembers popping a bottle of Dom Perignon champagne the day 


15 MARCH 2012 | VOL 483 | NATURE | 265 


© 2012 Macmillan Publishers Limited. All rights reserved 


news ama 
The ups and downs of decaffeination 


For more than a century, brewers and growers have looked for ways to create tasty, caffeine-free coffee. 


Chemistry 

© Several well-established 
chemical methods strip 
beans of their caffeine, 
which can be sold for use 
in drugs and cosmetics. 


© Extraction processes 
can be very expensive. 
Coffee aficionados say that 
the flavour is destroyed. 


Hybridization 
© Caffeine-free species of 
Coffea might be crossed 
with better-tasting Coffea 
arabica. 

© Maintaining all desired 
traits is complex. One 
potential low-caffeine 
(~0.37%) strain is a hybrid 
of three species. 


Breeding 

© Maintaining the desired 
traits of Coffea arabica 
might be easier if it were 
bred with a caffeine-free 


© Caffeine-free strains 
are hard to find. Some 
undesirable traits may still 
be difficult to ‘breed out’. 


strain of the same species. 


Genetic , 
engineering 
© Can specifically 
target caffeine- 
biosynthesis genes. 


© Coffee species are 
notoriously difficult to 
engineer. Public 
acceptance of genetically 
modified foods is low. 


Mutagenization 
© Awell-accepted 
method for creating 
variation in crop traits. 

@ Resultant specimens 
need to be screened, 
which can be a long and 
arduous process. Non- 
targeted traits may be 
affected. 


his paper was accepted by Nature’. He has since tailored the methods 
to C. arabica, but the plants have still not produced beans. Now at 
Toyama Prefectural University, Ogita tends about 40 transgenic plants. 
Each year, he says, the female part of the flowers, called the pistil, 
matures and deteriorates a week before the pollen is ready. 

Even if Ogita can overcome these breeding issues, it is hard to imag- 
ine that transgenic caffeine-free coffee will land on supermarket shelves 
any time soon. One basic hurdle, says biochemist Alan Crozier at the 
University of Glasgow, UK, is that side routes in the caffeine pathway 
result in some production of caffeine, so transgenic coffee may never 
be caffeine-free. Gaining public acceptance is likely to be another chal- 
lenge. Monsanto and other agricultural giants have generally commer- 
cialized technologies with the grower in mind — for example, pest 
resistance or herbicide tolerance — rather than 
focus on consumer-oriented traits such as low 
caffeine or high antioxidants. So finding financial 


The biggest 


Mazzafera, based at the university, has only a limited role. “It’s disap- 
pointing,” he says. 

Nevertheless, despite so many years without success, the quest for 
caffeine-free coffee shows little sign of fading. Plant geneticist Benoit 
Bertrand of the French Centre for Agricultural Research and Devel- 
opment in Montpellier has been searching the centre’s collection for 
caffeine-free plants. And Chifumi Nagai of the Hawaiian Agriculture 
Research Center in Waipahu has been working in Madagascar with 
the Japanese coffee manufacturer UCC Ueshima to develop a three- 
species hybrid that tastes good, has a moderate yield and contains just 
0.37% caffeine’. Its success is uncertain; Madagascar faces significant 
logistical challenges even growing and harvesting normal C. arabica. 

Even Mazzafera, now 51, hasn't given up. Ona cloudy November day, 
he walks out past two mesh shade houses behind 
the biology department in Campinas and reveals 
several hundred hip-high coffee plants. Some have 


backing may be tricky. rows of green coffee berries clustered on their 
challenge of all may branches. Many, he says, are nearly caffeine-free. 
BREED APPEAL M4 . Miffed at his inability to continue his work with 
A general aversion to genetically modified food be that caffeine is the Ethiopian varieties, he came up with a new 
has made the naturally caffeine-free Ethiopian there for a reason. plan in 2006. He took the seeds of a productive 


strains that much more appealing. But breeding 
the trait into a commercially viable cultivar has 
taken longer than Mazzafera or Silvarolla anticipated. During the flow- 
ering season, Silvarolla spends all day in the field, clipping the pollen- 
producing stamens off the flowers and bagging the pistils so that she 
can hand-pollinate them later. She produces 800 new plants every year. 

The beans that are produced taste good, according to tasting panels 
put together by the IAC, but the plants tend to be bushy and don't 
flower uniformly. The team is now working on breeding plants that 
have the low-caffeine trait but not the low-productivity one. “At first, 
we thought it would be a piece of cake,’ says Miriam Maluf, a geneticist 
at the IAC working on the project. But the researchers may be facing 
the biggest challenge of all in removing caffeine from living plants: it 
is there for a reason. Caffeine is a natural insecticide, which explains 
why wild coffee plants that lack caffeine tend to contain other bitter 
compounds — to deter pests. 

And the researchers have to contend with these issues largely 
without Mazzafera’s help. Soon after the 2004 discovery and subse- 
quent publicity, the IAC took greater control of the programme, and 
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C. arabica variety, soaked them in chemicals that 

cause mutations, and then screened the caffeine 
levels of 28,000 seedlings. “It was a shot in the dark,” he says. He ended 
up with 7 plants that have only 2% of normal caffeine levels*. He has 
already trademarked their name: Decaffito. 

Challenges remain — the strain is susceptible to cross pollination, 
which can reinstate caffeine production in the beans — but he is deter- 
mined to breed a commercially viable strain. He has even been talk- 
ing to one company about investing in his research. But knowing the 
hurdles ahead, he is willing to settle for less. “If 1 had a farm, he says, 
“I would grow this coffee for myself” m 


Brendan Borrell is a freelance journalist in New York City. 
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Plug the real brain drain 


Martin Schwab and Anita Buchli suggest ways to jump-start the stalled 
development of therapies for neurological diseases. 


ecently, at the annual retreat of the 
Ras Neuroscience Center, we ran 
into a former colleague who had 
often helped us to prepare for courses we 
were teaching. But he was not there to teach 
— he was participating in a demonstration 
as a patient. A stroke had left him paralysed 
on one side, wheelchair-bound and unable 
to speak. He had been looking forward to 
interacting with the students, but when he 
could not communicate with them, he broke 
into tears. 

After a difficult rehabilitation, he was 
able to resume some of his work, but he still 
cannot speak. His arm and leg will probably 
remain paralysed for the rest of his life. 

Our colleague was one of the 8.2 million 
Europeans who experience stroke every 


year’. The brain is a source of many devastat- 
ing disorders — such as Alzheimer’s disease, 
Parkinson's disease and amyotrophic lateral 
sclerosis — and injuries to the spinal cord 
or brain can lead to lifelong impairments. 
At present, disabling spinal-cord injuries 
affect roughly 350,000 people in Europe and 
250,000 in the United States. Traumatic brain 
injuries are about ten times more common. 
Treatments that could restore lost functions 
to people with such injuries would radically 
change their lives and decrease the burden 
to their families and 
social environment. 
The economic inter- 
est to drug companies 
and health insurers 
seems obvious. Yet 


2 NATURE.COM 
Read more about the 
crisis in brain-drug 
development: 
go.nature.com/Sgoxwg 


© 2012 Macmillan Publishers Limited. All rights reserved 


drug companies have withdrawn from 
neuroscience, more so than from any 
other disease area. Last year, Novartis 
closed its preclinical neuroscience research 
facility in Basel, Switzerland. Pfizer, 
GlaxoSmithKline and AstraZeneca had 
already made similar moves. Merck and 
Sanofi are also cutting research on brain 
diseases. 

Until recently, industry funded nearly half 
the budget for research and drug develop- 
ment for brain disorders’. Its retreat has left 
an abyssal hole. 

The reason for companies’ reluctance 
to pursue drugs for neurological disor- 
ders is fairly straightforward: their invest- 
ments havent paid off. In the past 10-15 
years, dozens of clinical trials for stroke > 
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> neuroprotection — involving thousands 
of patients — have failed. 

To get drug development going again, we 
must tackle the problems that have stalled 
it in the past by building a culture of inter- 
disciplinary exchange to generate promising 
compounds and setting aside public funds to 
conduct small, well-designed clinical studies 
of those compounds. We realize that in 
such a tight funding situation, every field is 
asking for more. But given the extraordinary 
burdens neurological diseases cause, they 
must become more ofa priority. 


ANEW HOPE 

Drug companies have pulled out of neuro- 
science just as our understanding of brain 
plasticity has exploded. The antiquated view 
of the central nervous system as a hard- 
wired supercomputer has been overturned; 
the brain and spinal cord now appear as 
dynamic and adaptable biological systems. 

Large injuries to the brain and spinal cord 
are not repaired spontaneously, causing life- 
long impairment. But scientists have recently 
developed experimental interventions that 
enhance nerve-fibre growth and regen- 
eration in animals with massive brain injury. 
In experiments with rats, mice and monkeys, 
researchers (in our laboratory and others) 
have induced regrowth of injured nerve fibres 
in the brain and spinal cord by suppressing 
growth inhibitors — enough for the treated 
animals to regain lost functions™. 

We and our colleagues at Novartis are now 
conducting a clinical trial in which people 
with spinal-cord injuries receive an antibody 
that counteracts the neural growth inhibi- 
tor Nogo-A (also known as reticulon-4A). 
Other clinical trials 


to enhance repair of “ If researchers 
the spinal cordand eg|iqhorate 
brain are or will soon from the 

be under way. But outset, they are 
progress is slow — more likely to 
the biotechnology produce adrug 


company Geron, for 
example, recently 
abandoned a prom- 
ising phase I trial of stem cells in spinal-cord 
injury to concentrate instead on cancer’. 

Why have so many trials failed, and what 
should be done better? A drug may be effec- 
tive and still fail in a trial. One reason is that 
companies often look for the most broadly 
applicable drug — for example, ‘for all stroke 
patients’ — but disease conditions often differ 
among patients, resulting in huge variations 
in treatment responses. Another problem 
with past trials was that the often crude clin- 
ical endpoints missed small but meaningful 
treatment effects, such as improvements in 
hand, leg or bladder function. With novel 
approaches, we can do better. 

To reinvigorate the field and avoid repeat- 
ing past problems, more exchange should 


that works.” 
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COSTS AND RESEARCH FUNDING IN EUROPE 


Brain disorders are costlier and more prevalent than cancer but got similar research funding in 2005. 


Total costs for 2010 (ref. 1) 

Direct costs! 

Indirect costs* 

Proportion of all health-care expenditures 
Percentage of 2004 disease burden’ 
Total 2005 research funding? 


Industry contribution 


Brain disorders Cancer 

€798 billion (US$1 trillion) | €150 billion—250 billion 
60% 41% 

40% 59% 

24% 6.3% 

35% 16.7% 

€4.1 billion €3.9 billion 

€3.3 billion €2.5 billion 


+Expenses such as medication or doctor’s visits. {Produc’ 


be fostered between basic and clinical 
scientists. When spinal-cord researchers 
began organizing retreats and workshops 
to bring together basic researchers and clin- 
icians, they saw first-hand how little each 
side knows about how the other works. The 
mutual lack of knowledge was huge; each 
side had completely different language to 
describe the same scenario. 


A WISER APPROACH 

If researchers collaborate from the outset, 
they are more likely to produce a drug that 
works’. For instance, they could establish a 
set of criteria to evaluate a particular therapy 
in both animals and humans, so that what 
seems to workin one is more likely to seem to 
work in the other. Newer diagnostic tools will 
enable scientists to identify which subgroups 
might benefit most from a specific therapy. 
In addition, clinicians are now standardizing 
observations of functional improvement so 
that they can spot subtle changes that would 
have gone unnoticed in the past. 

Neuroscience faculties and medical 
centres must work together to establish 
research consortia and networks that unite 
basic and clinical scientists. On a smaller 
scale, retreats with select groups of experts 
from both sides are inexpensive and can 
jump-start a field. Already, studies of spinal- 
cord injury are more focused now that the 
two sides are communicating — some basic 
researchers have begun using clinical criteria 
for functional improvement. 

We can't just throw money and resources 
at the problem — we must use them wisely. 
Instead of investing billions in one drug, 
let’s spread funding among smaller, proof- 
of-concept trials for compounds with good 
preclinical evidence. By focusing on well- 
selected populations (with tens of patients, 
not hundreds) and concentrating on a few 
centres, such trials would cost a few million 
euros rather than the €50 million (US$67 
million) or more needed for one large 
trial. If smaller trials can bring a promising 
compound to an advanced stage, industry 
may then be willing to take it to market. 

And, because the pharmaceutical industry 
isn’t ready to invest in early-stage research in 
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ivity lost to time off work or early retirement. 


neurological diseases, we must turn to other 
sources. Insurance companies spend up to 
€2 million for each patient with a spinal-cord 
injury — a drug that could lower a patient’s 
disability would save insurers huge amounts. 
In 2009, the top five US health insurers 
earned more than US$12 billion; investment 
of even a small percentage of these profits 
in research could result in a true win-win 
situation. 

It is in countries’ best interests to dedicate 
more public money to small trials of therapies 
for brain diseases. People with such disorders 
may spend decades of their lives disabled, 
which can have enormous effects on their 
lives and on those of the people around them. 

In 2011, a report commissioned by the 
European Brain Council found that, in terms 
of health-care costs and lost productivity, 
brain disorders are a greater socio-economic 
burden than cancer, cardiovascular diseases 
and diabetes combined’. Yet in 2005, research 
funding for cancer and neurological diseases 
was roughly equal (see ‘Costs and research 
funding in Europe’). More than half of that 
total comprised private funding; now that 
drug companies have shifted focus, cancer 
funding is likely to eclipse that of neuro- 
science. Funding agencies must revise their 
budgets to reflect the immediate and future 
needs of our society. m SEE COMMENT P. 269 


Martin E. Schwab and Anita D. Buchli 
are at the Brain Research Institute of the 
University of Zurich and the Department 
of Health Sciences and Technology at the 
Federal Institute of Technology, CH-8057 
Zurich, Switzerland. 

e-mail: schwab@hifo.uzh.ch 


1. Gustavsson, et al. Eur. Neuropsychopharmacol. 
21, 718-779 (2011). 
2. Sobocki, P., Lekander, |., Berwick, S., Olesen, J. 
& Jonsson, B. Eur. J. Neurosci. 24, 2691-2693 
(2006). 
3. Zoerner, B. & Schwab, M. E. Ann. NY Acad. Sci. 
1198, E22-E34 (2010). 
4. Liu, K., Tedeschi, A., Park, K. K. & He, Z. Annu. Rev. 
Neurosci. 34, 131-152 (2011). 
. Baker, M. Nature 479, 459 (2011). 
. Kwon, B. K., Hillyer, J. & Tetzlaff, W. 
J. Neurotrauma 27, 21-33 (2010). 
7. Andlin-Sobocki, P., Jonsson, B., Wittchen, H. U. & 
Olesen, J. Eur. J. Neurol. 12, 1-27 (2005). 


au 


ILLUSTRATION BY 0.0.P.S FOR NATURE 


A plan for mental illness 


Thomas R. Insel and Barbara J. Sahakian describe what the next 
decade of mental-health drug development should look like. 


o chronic disease burdens the world 

more than mental illness’. In any 

given year, nearly 40% of the popu- 
lation in 30 European countries is affected. 
And yet, the world is experiencing a crisis in 
drug development for mental illness; drug 
companies are withdrawing from the field or 
redirecting their investments”. Last Novem- 
ber, an eagerly awaited compound, called 
TC-5214, failed to significantly relieve major 
depression in a phase III trial’. The timing 
could not have been worse. Recognizing 
that the pipeline was starting to run dry, the 
Royal Society in London recently convened 
an international scientific seminar to find 
solutions to this pharmacological impasse. 
The meeting concluded that a fundamental 
change was needed in nearly every aspect of 
translational research in mental health. 

For better treatments, better science is 
needed. Genetic studies are already identi- 
fying new molecular targets in other fields — 
for example, research in sickle-cell anaemia 
has uncovered a gene that represses fetal hae- 
moglobin production, providing promising 
new drug targets’. The molecular mecha- 
nisms of mental disorders are very complex, 
but the latest genetic technologies should 
help to identify effective treatment targets, 
and offer alternatives to the monoamines 
that have dominated psychiatric medica- 
tions for halfa century. 

Borrowing from the success of preven- 
tion in other medical areas, ways to iden- 
tify risk factors of mental illness need to be 
developed so that intervention precedes dis- 
ability. Identifying these factors at an early 
stage may mean psychosocial treatments 
(such as, family support and cognitive train- 
ing) are more effective than medication. 
Early detection could emerge from neuro- 
imaging, cognitive assessments, predictive 
biomarkers or rare genetic variants that sig- 
nal risk or a prodromal stage. 

Even therapeutic end points need to be 
refocused. Currently, treatment is not aimed 
at full recovery, and targets late stages of ill- 
ness rather than using pre-emptive therapies 
in early stages when outcomes might be bet- 
ter’. Treatment should also address adher- 
ence, which remains a problem in 20-70% 
of patients with psychiatric illness®. 

One method for improving the effective- 
ness of treatment is integrating medical and 
psychosocial approaches. In a study of pho- 
bias, which combined the drug D-cycloserine 
and cognitive-behavioural therapy, people 


given medication during virtual-reality ses- 
sions designed to accustom them to fearful 
situations saw symptoms improve faster 
than those just given a placebo with virtual- 
reality sessions’. Other potential techniques 
and ‘neurotechnologies’ include video-game 


training to help people in the early stages of 


schizophrenia to retain function or to help 
children with autism to interact socially, for 
example by increasing eye contact”. 

From a research perspective, progress 
requires changes not only in what we do but 


also how we do it. As industry withdraws 
from the field of mental health, we have an 
opportunity to rethink how academia and 
industry can work together for the public 
good. One obvious way to do this is to allow 
academic scientists to investigate com- 
pounds that have been abandoned by indus- 
try. But we can not simply repeat the best 
efforts of a company-directed failed clinical 
trial without new strategies. We should con- 
sider new approaches to identifying poten- 
tial drug targets and their validation — such 
as open-access drug development — to 
include industry and academia. 

We must also reconsider our use of ani- 
mal models; too many compounds that 
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seem to treat a disease in animals end up 
having little impact in humans. Although 
animal models may be unable to model all 
disease symptoms — such as those of schizo- 
phrenia — they can help to understand the 
neural mechanisms of specific symptoms, 
such as impulsivity’. Effective treatment of 
impulsivity could have a significant clinical 
effect on neuropsychiatric disorders such 
as attention deficit hyperactivity disorder, 
mania and substance abuse. We need to rec- 
ognize the promise and limitations of animal 
studies, so trials can move quickly into work 
with human cells and volunteers to define 
or eliminate targets for drug development. 

To achieve this vision, we need a new 
cadre of clinical neuroscientists® to focus on 
integrating basic and clinical neuroscience, 
translational medicine, and new methodolo- 
gies. Patients are becoming more involved in 
drug development through advocacy groups 
and allowing their information to be held in 
registries, providing research partnerships 
with an opportunity to identify biomarkers 
and specific phenotypes that pinpoint the 
patient subgroups most likely to benefit from 
a treatment”””. 

The increasing public-health needs of those 
with mental illness, including chronic disabil- 
ity, high suicide rates and early death, demand 
that we do better in the next decade. m 


Thomas Insel is at the National 

Institute of Mental Health, National 
Institutes of Health, Bethesda, Maryland, 
USA. Barbara J. Sahakian is in 

the Department of Psychiatry and 
MRC/Wellcome Trust Behavioural and 
Clinical Neuroscience Institute, University 
of Cambridge, UK. The full author list can 
be found at the online version of this article 
(see go.nature.com/tlihrn). 

e-mail: bjs1001@cam.ac.uk 


1. Collins, P. Y. et a/. Nature 475, 27-30 (2011). 

2. Cressey, D. Nature http://dx.doi.org/10.1038/ 
news.2011.367 (2011). 

3. Ledford, H. Nature 479, 278 (2011). 

4. Sankaran, V. G. et al. Science 322, 1839-1842 
(2008). 

5. Sahakian, B. J., Malloch, G. & Kennard, C. Lancet 
375, 1854-1855 (2010). 

6. Julius, R. J., Novitsky, M. A. Jr & Dubin, W. R. 
J. Psychiatr. Pract. 15, 34-44 (2009). 

7. Ressler, K. J. et al. Arch. Gen. Psychiatry 61, 
1136-1144 (2004). 

8. Lehner, T. & Insel, T. Acad. Psychiatry 34, 87-89 
(2010). 

9. Ledford, H. Nature http://dx.doi.org/10.1038/ 
news.2011.324 (2011). 


ene K. & Hurlbut, J. B. Nature 478, 312-313, 


15 MARCH 2012 | VOL 483 | NATURE | 269 


The brain is full of feedback loops and neuronal links that combine to give rise to consciousness. 


The mind mapped 


Robert Stickgold revels in a lively account of a quest to 


quantify consciousness. 


he psychologist Stuart Sutherland 
Tice that it is impossible to define 

consciousness “except in terms that 
are unintelligible without a grasp of what 
consciousness means ... Nothing worth 
reading has been written about it” It is argu- 
able whether Christof Koch’s Consciousness 
provides such a definition, but the book is 
definitely worth reading. 


Koch, chief scientific officer at the Allen 
Institute for Brain Science in Seattle, Wash- 
ington, is perhaps best known for his work 
with the late Francis Crick, searching for the 
neurobiological ‘correlates of consciousness. 
Here, he succinctly lays out the story of that 
quest. Focusing on how the brain might pro- 
duce the mind, Koch mixes descriptions of 
major experiments with self-reflection and 
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warnings oftheinher- — - 
ent danger of the exer- 


cise. Ted 
From Koch’s col- | fen Be 

laborations with 

Crick, whom he 


CONSc¢ IOUSNE ss 


seems to idolize, to 3 Re | 
his struggles with ~—% itt: ' 
religion and free will, A 


this is an engaging 
mixture of personal 
anecdote, scientific 
fact and pure specu- 
lation. It is often 
charming: Chapter 2, 
for instance, is enti- 
tled, ‘In which I write 
about the wellsprings 
of my inner conflict between religion and 
reason, why I grew up wanting to be a sci- 

entist, why I wear a lapel pin of Professor 

Calculus, and how I acquired a second 

mentor late in life’ 

For many, the richest parts of the book 
will be Koch’s lucid descriptions of experi- 
ments such as his work with Itzhak Fried, 
a neurosurgeon who implanted electrodes 
into the hippocampi of people with epilepsy. 
In one patient, Fried found a single neuron 
that responded only to the name or pictures 
of Saddam Hussein; in another, he found 
one that responded only to pictures of the 
actress Jennifer Aniston. Ina descriptive tour 
de force, Koch explains that although Fried 
dubbed these cells concept neurons, we can 
think of them as “the cellular substrate of the 
Platonic Ideal of Jennifer Aniston”. 

Koch discusses the two theories that form 
the cornerstones of his own views of con- 
sciousness. The first, which he proposed 
with Crick, describes the neural correlates 
of consciousness as involving “reverbera- 
tory feedback loops” (which link higher- 
order sensory regions to the prefrontal 
cortex) and corticothalamic loops (which 
link these regions and nuclei in the thala- 
mus). These networks then marinate ina 
bath of neurochemicals released by fibres 
coursing up from a variety of brainstem 
regions. How these connections produce 
consciousness is explained by the second 
theory, the integrated information theory 
of consciousness. 

Here, the story seems to stumble. Con- 
sciousness, Koch argues, is not an emer- 
gent property of the brain. Rather, it is 
inherent in all matter, as “the amount of 
integrated information generated by the 
system in [a given] state above and beyond 
[that] generated by its parts”. If this seems 
unclear, it is. Koch states that calculat- 
ing this quantity is extremely difficult, 
and demands “heuristics, shortcuts, and 
approximations”. The higher the value, he 
says, the more conscious the system. But 
what does it mean to say that one system > 


Consciousness: 
Confessions 

of a Romantic 
Reductionist 
CHRISTOF KOCH 
MIT Press: 2012. 
184 pp. $24.95, 
£17.95 
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> has twice as much consciousness as 
another? 

There can also be too much gee-whiz 
science for my taste. For instance, talking 
about the nervous system of the worm 
Caenorhabditis elegans with its 302 neu- 
rons, Koch describes the creature's “state 
of consciousness” as the current posi- 
tion of the system on a 302-dimensional 
graph, with 2*” possible states (for each 
combination of the 302 neurons either 
firing or not). The neural network is a 
material thing, Koch argues, but the shape 
of the graph describes the conscious 
experience of the worm at that moment, 
“its phenomenal experience”. 

By this argument, the 32,768-byte 
computer file holding this review has a 
consciousness defined by its position 
on a 32,768-dimensional graph with 
256°” (or, put more quaintly, 1 followed 
by 87,000 zeros) possible states. Person- 
ally, I don't think any of them have much 
chance of being conscious. 

There are other points on which Con- 
sciousness lacks clarity. Koch tackles 
most of the big questions: are mice and 
fruitflies conscious (yes); can comput- 
ers be conscious (yes); are they already 
(yes); is there free will (not really); can 
we ever understand consciousness (yes); 
can the mind affect our behaviour (no); 
and even is there a god (yes, but absent 

since the Big Bang). 


“What does Unfortunately, it 
it mean to is often hard to 
say that decipher which of 
one system these he thinks are 
hastwite proven facts, which 
wemiich personal beliefs and 
j which momentary 
comin in 


In fairness, the 
equations are much 
more complicated than is described here. 
Still, in the end, the theory seems to me 
to be an example ofa conflation of igno- 
rances: saying that because we don't really 
understand A or B, they must be causally 
related. Whether they invoke quantum 
mechanics, microtubules or integrated 
information content, these explanations 
of consciousness are, for all their math- 
ematical precision, no more than pure 
speculation. Perhaps one of them is actu- 
ally right. But perhaps, like ancient Greek 
philosophers searching for the cause of 
thunder, we are still thousands of years 
from an answer. 

argued with Koch all the way through 
this book. And I loved every minute of it. m 


Robert Stickgold is an associate 
professor of psychiatry at Harvard 
Medical School in Boston, Massachusetts. 
e-mail: rstickgold@hms.harvard.edu 
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Humans have been probing the mysteries of the Antarctic for almost 200 years. 


POLAR RESEARCH 


Deep-frozen science 


Francis Halzen is exhilarated by a trek through stories 
of research and exploration in Antarctica. 


dazzling array of narratives throngs 
Aw ca, collected from scien- 
tists in one of Earth's most extreme 
environments. Science writer and consultant 
Gabrielle Walker gathered these stories in the 
course of five trips criss-crossing the conti- 
nent, mostly as a guest of the US National 
Science Foundation’s Antarctic programme. 
This is not just a highly accessible ency- 
clopedia of Antarctic science. It interlaces 
researchers stories with natural history, tales 
of the ‘heroic age’ of exploration and pas- 
sages that viscerally describe the cold, isola- 
tion and beauty of the environment. Neatly 
organized geographically, the book covers 
the East Antarctic coast, with McMurdo Sta- 
tion (the largest community in Antarctica) 
and the penguins; the high plateau, with the 
Concordia and South Pole research stations; 
and the isolated West Antarctic coast. 
Walker talks about the Dry Valleys near 
McMurdo, which run from the edge of the ice 
sheet that covers most of the continent down 
to the Ross Sea. Precious little precipitation 
has fallen here for millions of years: with an 
average temperature of —55 °C, this is Mars 
on Earth. But it is teeming with life — cyano- 
bacteria, found in ponds the world over. 
Here, they live inside the dry rock, surfacing 
for only a few weeks a year to find water from 
the little snow that fell over winter. Then they 


go back to sleep. 

Walker learned that NATURE.COM 
fascinating story on Toread about 
a diving expedition Antarctic research 
in Lake Hoare inthe 100 years ago: 
Dry Valleys with Peter —_ go.tiature.com/9fhuuj 
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Doran, a biologist at 
the University of Illi- 
nois at Chicago. But 
it isn’t only research- 
ers who feature here. 
The book is peppered 
with characters who 
“keep the scientists 
alive”. On one trip we 
meet camp manager 
Rae Spain, who came 


Antarctica: An 
Intimate Portrait 


i] 
vee . to Antarctica as a car- 
Most Mysterious 
Continent penter and returned 
GABRIELLE WALKER because she could not 


Bloomsbury: 2012. 
416 pp. £20 


get the continent out 
of her head. “It haunts 
you,’ she told Walker. 

Mars has literally come to Earth just 
beyond the Dry Valleys, Walker tells us. 
Researchers on Skidoos systematically scan 
the ice sheet for meteorites — a relatively easy 
task, given that everything stands out in this 
icy landscape. We accompany John Schutt, a 
mountaineer who has returned for the hunt 
every year since 1980. More meteorites have 
been found since the 1970s in Antarctica 
than over centuries in the rest of the world. 

In 1982, for the first time, an Antarctic 
meteorite was identified as coming from the 
Moon. Two years later, researchers found a 
rock from Mars that turned out to contain 
structures that may be nanoscale fossils: the 
most intriguing indications yet that life may 
have existed on other planets in the Solar Sys- 
tem, although debate is still raging. 

The West Antarctic coast, as Walker 
shows, is out of reach of the permanent 


D. TEWKSBURY 


scientific stations, and difficult to approach 
by vessel. This is where gigantic glaciers 
empty into the Pacific Ocean, and studies 
have revealed rapid changes in the ice flow. 
Helen Fricker, a glaciologist at the Scripps 
Institution of Oceanography in La Jolla, 
California, has traced these changes to sub- 
glacial lakes linked by canals, which form a 
dynamic hydrologic system on which the ice 
slides to the sea. Walker journeys across the 
inhospitable glaciers, where she stands on ice 
flows the size of Niagara Falls with George 
Denton of the University of Maine in Orono, 
a veteran in ‘reading’ glacial landscapes. 

Some of the historical vignettes will be 
familiar, such as the race to the South Pole 
between Norwegian explorer Roald Amund- 
sen and his trained team, and Robert Scott, 
British hero of scientists, who presumably 
found Amundsen’ efficient approach ungen- 
tlemanly. Other stories are less well known, 
such as that of Australian geologist Douglas 
Mawson, who travelled to the south magnetic 
pole as part of Ernest Shackleton’s Nimrod 
expedition and later led a disastrous research 
trip to Adelie Land, on which two men died. 

Tales of those expeditions, and the scien- 
tific ones that followed, remind us that people 
have been travelling to Antarctica for almost 
two centuries, ever since seal hunter Captain 
John Davis first set foot there in 1821. The 
Antarctic Treaty, signed by 49 countries, has 
since 1961 guaranteed that the continent 
is reserved for science. High technology 
has now arrived: the collaborations behind 
the South Pole Telescope and the IceCube 
Neutrino Observatory have constructed 
the kinds of instrument that are more rou- 
tinely built at laboratories such as Fermilab 
near Batavia, Illinois, or CERN near Geneva, 
Switzerland. IceCube collects a few hundred 
neutrinos per day, some with energies that 
exceed those at earthbound accelerators by 
more than two orders of magnitude. 

Yet Antarctica is still the “world’s most 
mysterious continent’, as it remains the 
only one on which humans have never lived 
permanently. Walker captures that mystique 
through interviews with people who have 
made Antarctica part of their lives. Perhaps 
the most notable among them are the “tele- 
scope nannies’, who return each year in early 
February to spend the long Antarctic winters 
at the South Pole, taking care of the scientific 
equipment and data acquisition after scien- 
tists have boarded the last planes back to 
their universities. Their arduous job brings 
a rare reward: a sense of the untrammelled 
isolation of this vast continent. m 


Francis Halzen is the principal investigator 
of the IceCube project, and Hilldale and 
Gregory Breit Professor in the department 
of physics at the University of Wisconsin- 
Madison. 

e-mail: halzen@icecube. wisc.edu 


Books in brief 


Free Will 

Sam Harris FREE PRESS 96 pp. $9.99 (2012) 

Neuroscientist Sam Harris, the author of the bestselling The Moral 
Landscape (2010), here skewers the concept of free will — that 
mainstay of law, policy and politics — in fewer than 100 pages. 
Using evidence drawn from psychology and neuroscience, Harris 
asserts that the course of human life is all down to luck, and that 
willpower is a “biological phenomenon”. We are not in charge 

of our own minds, he says: thought and intention simply arise. 
This is a tract that is sure to boldly go straight to the heart of the 
determinism debate. 


= Why Calories Count: From Science to Politics 


WHY | Marion Nestle and Malden Nesheim UNIVERSITY OF CALIFORNIA PRESS 
304 pp. $19.95 (2012) 
CALORIES Obesity has gone global — as has misinformation about nutrition 
Se and food. Nutrition scientists Marion Nestle and Malden Nesheim 
Counr | unscramble the confusion with a serving of science. They reveal 
| w | how calories — those potent but ill-understood measures of heat 


energy — are really counted, why we need them, how we use them, 
how many we actually need and why it all sometimes goes so wrong. 

Vf From ‘secret’ calories to food politics, malnourishment and calorie 
restriction for health, this is a feast for the mind. 


Born Believers: The Science of Children’s Religious Belief 

Justin L. Barrett FREE PRESS 320 pp. $26.00 (2012) 

Psychologist Justin Barrett says that belief is rooted notin 
indoctrination, but in a default setting of the infant brain. A range 

of findings in developmental psychology, he argues, support the 
theory: very young children, for instance, are aware that nature is 
not designed by humans; and people tend to look for unseen agents 
behind natural phenomena. Barrett says that the evidence points to 
a universal “natural religion” — but he also argues that parents must 
not impose belief systems on their children, and should leave the 
question of whether to die a believer up to their offspring. 


Tinderbox: How the West Sparked the AIDS Epidemic and How the 
World Can Finally Overcome It 

Daniel Halperin and Craig Timberg PENGUIN 432 pp. $29.95 (2012) 
The butchering of a chimpanzee in Cameroon about a century ago 
may have been when simian immunodeficiency virus crossed into 
humans and became HIV. But, argue epidemiologist Daniel Halperin 
and journalist Craig Timberg, European greed and technologies 
sparked the HIV pandemic. Trade routes, the growth of colonial cities, 
a decrease in male circumcision and a rise in prostitution effectively 
incubated and spread the disease — and Western programmes to 
curb it in Africa have proved mostly ineffectual, say the authors. 


Say Digitized: The Science of Computers and How it Shapes Our World 
D 4 Peter J. Bentley OXFORD UNIVERSITY PRESS 256 pp. £16.99 (2012) 
IG 'T The wiring of the world — from a glint in Alan Turing’s eye to global 
IZE BD domination — is neatly encapsulated by computer scientist Peter 


Bentley. An enthusiastic guide, he lays this compelling history bare, 
| explaining the science as he goes: Moore’s law of processing-power 
growth, the circuit designs of Claude Shannon, John von Neumann’s 
early electronic digital computer, neural networks and the staggering 
ay array of applications for computing in everything from pizza delivery 
to Alzheimer’s-disease research. 
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Curved-crease origami 
sculptures can self-fold 
into intricate patterns. 


Q&A Erik Demaine 


The origami geometer 


Computer scientist Erik Demaine uses origami to advance computational geometry and create 
art. His paper sculptures, made with his father, artist Martin Demaine, are now on show at the 
Japanese American National Museum in Los Angeles, California; from August, the exhibition 
will tour the United States. He explains the challenges of folding together mathematics and art. 


How did you come to 
do most of your work 
with your father? 
When I was five, I 
helped him to design 
wire puzzles for toy 
shops across Canada. 
He educated me him- 
self as we travelled 
around the United 


Folding Paper: 


States. When I went — The Infinite 

‘ : Possibilities of 
to university at the Origami 
age of 12,heattended Japanese American 


all my classes, mostly 
computer science 
and mathematics. 
Eventually, we started 
working together on mathematics and art. 
He is now an artist in residence at the Mas- 
sachusetts Institute of Technology (MIT) in 
Cambridge, where I am a computer scientist. 


National Museum, Los 
Angeles, California. 
March-August 2012. 


How did you discover origami? 

I heard about Robert Lang, a great origami 
designer and physicist. Computational ori- 
gami was just starting, and my father and I 
became pioneers. Our first project was the 
fold-and-cut problem: how many shapes 
can be made by folding a sheet of paper, 
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making one straight cut, then unfolding it? 
Houdini made a five-pointed star in this way 
in the 1920s. It took us years to prove that you 
can make any straight-edged shape; we have 
designed swans, butterflies and the MIT logo. 
Another result was an algorithm for folding 
any three-dimensional (3D) shape out of one 
sheet of paper. 


What is ‘self-folding’ origami? 

It is a form in which the force of the creases 
pulls the material naturally into shape, with- 
out any manipulation. We're trying to find an 
algorithm to make any 3D shape with this 
method. One application would be a space 
station that assembled itself in space. We 
have devised a grid-based crease pattern that 
would allow a microscopic sheet to self-fold 
into any shape, in theory, by making cubes 
that stack together like 3D pixels. We haven't 
yet achieved self-folding at the nanoscale. 


What kind of origami are you doing now? 
We have been working with curved creases, 
the properties of which 


are poorly under- ONATURE.COM 
stood. We were try- For more onmaths 
ing out some curved — andorigami: 
patterns, and saw that —_go.nature.com/hittdy 
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they looked beautiful. We're still pursuing 
unsolved mathematical problems too. David 
Huffman, who invented the compression 
algorithms used for mobile phones, left many 
beautiful origami sculptures when he died in 
1999. He was getting to the point at which he 
could conceive a 3D form, then reverse engi- 
neer the steps to fold it with curved creases. 
We hope to have him as a posthumous co- 
author on some papers. 


How do you actually fold the paper? 

For prototype models we use a robotically 
controlled laser to score the paper, but we 
prefer a simple compass-like device. The 
final folding we always do by hand. At the 
first show we contributed to, we made three 
sculptures by folding two circular pieces of 
paper together using alternating pleats in 
concentric circles. Curved-crease sculptures 
go back to a late-1920s design by the Bauhaus 
school in Germany. The surprising thing is 
how they self-fold into ornate forms from 
these simple creases. For the Folding Paper 
exhibition, spanning the range of origami 
art, we gave ourselves a little more power by 
manually joining a few key points together. 


How did you come to study balloon animals? 
That was a collaboration with my father and 
Vi Hart, a composer whose own father is a 
mathematical sculptor. We noticed that bal- 
loon animals could be seen as outlining the 
edges of a flat-sided 3D solid, and we found an 
algorithm that tells you how many balloons 
you need to build a given solid. One appli- 
cation, conceived after the fact, would be to 
build a pavilion from a single tube. But our 
motivation was just to play with balloons. m 
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Ads against chimp 
research criticized 


We at the Federation of American 
Societies for Experimental 
Biology object to your publication 
of advertisements from the 
Humane Society of the United 
States that seem to misrepresent 
the importance of chimpanzees 
in biomedical research and testing 
(see, for example, facing page 407 
in 22/29 December 2011 issue of 
Nature). 

These advertisements 
claim that chimpanzees are a 
poor model for research into 
human disease because of their 
physiological and immunological 
differences. This contention is 
at odds with the findings of a US 
Institute of Medicine committee 
appointed to investigate whether 
chimpanzees are necessary for 
biomedical and behavioural 
research (see go.nature.com/ 
ruthsl). 

The committee, composed 
of scientific experts and led by 
a bioethicist, concluded that 
chimpanzees have been valuable 
models in the past and that, 
although “most current use of 
chimpanzees for biomedical 
research is unnecessary’, they are 
still needed to conclude research 
on monoclonal antibodies and 
possibly in the development ofa 
prophylactic hepatitis C vaccine. 

The committee emphasized 
that an outright ban on 
biomedical chimpanzee research 
would not be appropriate, in 
part because of new, emerging or 
re-emerging diseases that cannot 
be studied in non-chimpanzee 
models. 
Joseph C. LaManna FASEB, 
Bethesda, Maryland, USA. 
Joseph-LaManna@faseb.org 


Turing: Colossus 
computer revisited 


George Dyson's history of the 
Colossus computer (Nature 482, 
459-460; 2012) is somewhat 
misleading. The development 
of Colossus owed little to the 
Bletchley Park ‘bombe’ devices. 


After Bill Tutte developed 
a statistical strategy to tackle 
the German Fish cypher, Max 
Newman proposed a machine 
to implement it, using two 
paper tapes. With help from the 
Telecommunications Research 
Establishment (later the Royal 
Radar Establishment) and from 
Frank Morrell’s group at the Post 
Office Research Station at Dollis 
Hill, the Robinson machines 
were built. These were slow but 
validated the technique. Thomas 
Flowers’ group recommended 
the use of many more vacuum 
tubes to improve performance, 
but they were wrongly 
considered too unreliable by 
Bletchley Park. 

So Flowers built the first 
Colossus at Dollis Hill instead of 
Bletchley Park although, in time, 
Newman spoke to him about his 
requirements. 

One crucial technique 
developed by Turing and used 
by Tutte was that of ‘delta- 
ing’ or using the differences 
between characters rather than 
the characters themselves. If 
Dyson had pointed out that 
Tutte used this method to detect 
the statistical non-randomness 
of plain text, it would also have 
helped in understanding how 
different Fish was from Enigma. 
Henry Shipley Hexham, UK. 
h.h.shipley.6 1@cantab.net 


Turing: Brain model 
still incomplete 


In contemplating whether 

the brain is a good model for 
machine intelligence (Nature 
482, 462-463; 2012), I believe 
that Alan Turing’s principle that 
the brain performs computations 
will continue to hold true. But 

it seems clear from the state 

of machine intelligence today 
that we're missing some basic 
insight into the language of brain 
computation. 

Advances in genetics, 
electronics and optics are now 
enabling us to look into simple 
brains while they’re living 
and behaving, and to observe 


the activity of every neuron 

in real time. This influx of 
information should eventually 
help us to make intelligence 
comprehensible and replicable. 

A good approach to 
interpreting these data is the 
re-creation of neural processing 
in simulation, from sensation all 
the way to behaviour. But once 
we understand what's going on, 
it should be possible to create 
machine intelligence that doesn’t 
rely on reproducing low-level 
chemical dynamics. 

Over the coming decades we 
are likely to take high-level cues 
from biology on how to organize 
our silicon. Mind is merely a 
function of the brain, however, 
so we should be able to capture 
that function in any Turing- 
complete system once we know 
how the brain is organized to 
perform it. 

David Dalrymple Cambridge, 
Massachusetts, USA. 
davidad@alum.mit.edu 


Turing: Beyond the 
original concept 


Barry Cooper asks whether 
information can be increased 
through computation, pointing 
out that Turing computation does 
not create anything not already 
in the initial data (Nature 482, 
465; 2012). If we are limited to 
Turing machines, then I believe 
the answer to his question is no. 
But if we enhance them then 
information can accumulate. 

Turing machines are designed 
to model functions, not ongoing 
computations involving 
additional input over time. But if 
we enhance Turing machines by 
giving them a persistent memory 
and allowing them to alter their 
input by interacting with their 
environment, then information 
can increase. 

Interactive machines engage 
in input and output during 
computation, which is closer 
to how computers are used 
in practice than in the Turing 
machine framework: examples 
include operating systems, 


interactive agents in artificial 
intelligence, and solutions to 
some problems in control theory. 
This observation is largely 
unappreciated in theoretical 
computer science, but a handful 
of researchers are exploring this 
part of ‘super-Turing’ space. 
Christopher Kanan University 
of California San Diego, La Jolla, 
California, USA. 
ckanan@cs.ucsd.edu 


Be sparing with 
international laws 


We believe that Devi Sridhar’s 
justification for proposing 

a Framework Convention 

on Alcohol Control is 
problematic (Nature 482, 

302; 2012). Standard-setting 
international laws are largely 
dictated by powerful states, 
based on expectations that 
they themselves already meet, 
obliging poorer states to 
implement these ‘enlightened’ 
global policies ahead of local 
priorities. Litigation by foreign 
non-governmental organizations 
(NGOs) can also get in the way 
of national policy-making, 
particularly as most NGOs are 
led from the West. 

Sridhar’s proposal joins 
growing calls for new 
international health laws. But 
first we need more evidence 
that international laws achieve 
results commensurate with the 
cost of drafting, ratifying and 
implementing them. 

In our view, some clear criteria 
need to be fulfilled before the 
WHO invokes its law-making 
authority. These could be set by 
a commission on global health 
law. Ill-justified international 
health laws that dictate poor 
countries’ policies and priorities 
from afar could prevent serious 
consideration of initiatives better 
suited to legal instruments. 
Steven J. Hoffman McMaster 
University, Hamilton, Canada. 
hoffmans@mcmaster. ca 
John-Arne Rottingen Harvard 
Kennedy School, Cambridge, 
Massachusetts, USA. 
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Catalysis in tight spaces 


An acid has been found to catalyse the formation of a common chemical group, the spiroacetals, and to control which 
mirror-image isomer of the group is made. The key to success is the acid’s bulky molecular structure. SEE LETTER P.315 


NOAH Z. BURNS & ERIC N. JACOBSEN 


pposites attract. When applied to the 

charges in organic molecules, this idea 

serves as a foundation for understand- 
ing how such molecules react. A large portion 
of organic chemistry can be explained by the 
concept that electron-rich molecules (or sites 
on molecules), which have negative character, 
seek to interact with, and potentially donate 
electrons to, electron-poor species that have 
positive character. On page 315 of this issue, 
Corié and List! describe a fresh use of this 
concept, with a report of a negatively charged, 
bulky catalyst that controls the reaction of a 
positively charged intermediate in the synthe- 
sis of potentially useful compounds known as 
spiroacetals. 

Some of the most reactive, and therefore the 
most synthetically useful, species for organic 
transformations are those bearing a full nega- 
tive or positive charge. Of these, oxocarbenium 
ions — oxygen-containing cations that form 
as reaction intermediates (Fig. la) — are par- 
ticularly useful for making a variety of chemi- 
cal linkages and groups. For example, they are 
used to make acetal groups, in which a car- 
bon atom is connected to two oxygen atoms 
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Figure 1 | Acetals and an acetal precursor. 

a, Oxocarbenium ions are reactive intermediates 
that can be used to make various chemical groups, 
including b, acetals and ¢, spiroacetals. The rings 
depicted in the spiroacetal structure represent 
rings of any size. d, One of the simplest spiroacetals 
is olean, the pheromone of the olive fruitfly. 
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Figure 2 | A spiroacetal-forming reaction. a, Corié and List’ report a method for preparing chiral 
spiroacetals from a planar oxocarbenium ion. The ion is generated by treating its precursor with a 


Bronsted-acid catalyst, H—X. If the acid is not chiral, 


the alcohol group (OH) will attack the planar 


oxocarbenium ion from either face with equal likelihood, leading to a racemic (50:50) mixture of 
mirror-image isomers of a spiroacetal product (in this case, olean). b, With the chiral Bronsted acid 
developed by Corié and List, the chiral anion X*" generated after protonation (the asterisk denotes 

that the anion is chiral) can selectively block one face of the oxocarbenium ion, generating one 
mirror-image isomer selectively. The mirror-image isomer of the catalyst that is chosen determines 
which isomer of the product is produced. The anion formed from the acid catalyst is shown; the broken 
blue line indicates that the negative charge is delocalized across the atoms shown in blue. 


(Fig. 1b). Acetals are key linkages in numerous 
biomolecules, notably serving to bond sugar 
molecules together in long polymeric chains 
such as those found in amylose and amylo- 
pectin (the two components of starch), as well 
as in cellulose. 

The precise stereochemistry (the three- 
dimensional spatial arrangement) of acetals 
has profound implications. For example, both 
amylose and cellulose are made from the same 
glucose monomer, but have markedly different 
physical, chemical and biological properties. 
All that differentiates these two compounds 
is the stereochemistry at the acetal linker. 
Therefore, finding ways to control acetal 
stereochemistry — precisely what Cori¢ and 
List report’ in their reaction for constructing 
spiroacetals, a subclass of acetals formed when 
the acetal carbon joins two rings (Fig. 1c) — is 
a highly relevant challenge. The reaction that 
the authors report controls the chirality, or 
‘handedness’ of the products that form. 

Spiroacetals are found in a wide variety of 
naturally occurring, biologically active small 
molecules. For example, integramycin — a 
potent inhibitor of the integrase enzyme’ that 
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allows HIV-1 to integrate its genetic mater- 
ial into the DNA of a host cell — contains a 
spiroacetal. Olean, the pheromone of the olive 
fruitfly, is one of the simplest spiroacetals 
(Fig. 1d), and provides another illustration 
of the importance of stereochemistry to a 
molecule’s properties: the two mirror-image 
forms of olean selectively attract flies of a 
different sex’. 

Corié and List' constructed the spiro- 
acetals from a planar oxocarbenium ion that 
contained a pendant alcohol group (Fig. 2a). 
The stereochemical challenge was to control 
the addition of the alcohol group to just one 
face of the planar ion. One solution would be 
to use stereochemistry already present else- 
where in the molecule to direct the selectivity 
of the addition. But such internal control is, 
by definition, substrate-dependent and rarely 
allows both possible stereochemical out- 
comes of the reaction to be made selectively. 
A more recent and potentially more general 
approach is to control the stereochemistry 
using an external, negatively charged catalyst 
that interacts directly with the oxocarbenium 
ion through Coulombic attraction*”. Ideally, 


the stereochemistry of the catalyst alone can 
determine which of the two isomers of acetal 
is generated. 

The problem with this seemingly straight- 
forward strategy is that a very specific spatial 
relationship between the reacting site and the 
chiral catalyst is required to obtain high stereo- 
selectivity. Electrostatic attraction alone is not 
sufficient to impose the required level of order, 
because it is direction-independent. This prob- 
lem could be overcome by incorporating other 
features into the catalyst, to allow it to orient 
the charged substrate through attractive, non- 
covalent interactions — a principle that is well 
established in enzymatic catalysis, and which 
has also been established in small-molecule 
catalysis®. An alternative approach is to create 
a rigid, highly restricted environment near the 
charged area on the catalyst that might allow 
the cationic substrate to interact with the cata- 
lyst only in a highly specific manner. 

Corié and List’ adopted the second of 
these strategies in their work. They used 
an exceptionally bulky Bronsted acid — a 
compound that can donate protons (H"*) to 
other molecules — as a catalyst’ to construct 
a variety of small spiroacetals in a highly 
stereoselective manner. In the authors’ sys- 
tem, the catalyst donates a proton to the sub- 
strate, which becomes an oxocarbenium ion. 
The deprotonated Bronsted acid (Fig. 2b) 
then acts as a counteranion for the oxo- 
carbenium ion. A complementary approach*” 
for generating oxocarbenium ions is to use a 
catalyst that removes an anion from a suit- 
ably reactive substrate; by forming a nega- 
tively charged complex with the anion, the 
catalyst controls subsequent additions to the 
oxocarbenium ion. 

A notable aspect of Corié and List’s work 
is that simple substrates can be used that lack 
the large or polar groups often required in 
Bronsted-acid-catalysed reactions to ensure 
effective catalyst-substrate interactions. The 
authors propose’ that this is because they used 
a particularly large Bronsted-acid catalyst, so 
the bulky anion that forms from the catalyst 
after the proton-transfer step can interact in 
a specific enough manner with the oxocar- 
benium ion to preferentially block one of the 
cation’s faces. This prevents the alcohol from 
adding to that face and thereby directs addition 
selectively to the other face. Addition to either 
face can be selected for in a reaction simply by 
choosing the appropriate mirror-image form 
of the catalyst. 

At present, Corié and List’s method seems 
to be limited to reactions that generate five- or 
six-membered spiroacetal rings. One remain- 
ing question is whether the success of the cata- 
lyst with simple substrates may be extended 
to more complex molecules. For example, 
the authors report’ that their Bronsted acid 
is unable to control and override the inherent 
selectivity of spiroacetal formation in a large 
steroidal molecule that contains many other 


chemical groups and rings of atoms. Many 
biologically active spiroacetals are similarly 
complex, and so a method for synthesizing 
them would be highly desirable. 

A mechanistic investigation of Corié and 
List’s reactions will be needed to establish 
whether the proposed confined-space effect 
is in fact operative, and to discover precisely 
how the cation-anion pair formed during the 
reactions yields spiroacetals stereoselectively. 
But such effort seems worthwhile, given the 
potential of bulky Bronsted acids as catalysts 
for other valuable organic transformations. m 


Noah Z. Burns and Eric N. Jacobsen are in 
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Inside story of 
ferroelectric memories 


The drive to improve digital memory through ever-shrinking electronic 
circuitry will ultimately face a bottleneck. Researchers propose exploiting 
the room ‘inside’ memory elements as a solution. 


VINCENT GARCIA & MANUEL BIBES 


he amount of stored information in 
the world is reaching an astronomi- 
cal'5 x 10”! bits — roughly one million 
times the number of cereal grains produced 
on Earth in a year’. And the number of bits 
doubles every three years’. This has moti- 
vated efforts to produce improved digital 
memory solutions that have better reliability, 
lower power consumption, lower cost and, 


a Binary FERAM 


6 x 6 cells 
1 bit per cell 
36 bits 


crucially, higher storage density. Writing in 
Advanced Materials, Lee et al.’ propose a 
method to increase the storage density of an 
existing class of commercial memory. The 
technique involves controlling the physical 
process through which information is writ- 
ten in memory cells — the building blocks of 
data-storage devices — instead of the usual 
approach of reducing the cells’ size. 

The data-storage market is dominated by 
hard-disk drives (HDDs) and Flash memories. 
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Figure 1 | Binary and multilevel data storage. a, Ina classical binary ferroelectric random access 
memory (FERAM), a bit of information, with a logic value of ‘1’ or ‘0’ is encoded in the up (red block) 

or down (blue block) polarization direction of the device's constituent cells. In this 36-bit example, the 
bits are stored in 6 x 6 cells, each of which stores one bit. Electrical wires (grey) are used to apply voltages 
across the cells to write and read the cells’ polarization. b, Lee et al. describe a multilevel FERAM that can 
store three bits in a single cell. In this way, three-bit logic states (000, 001, 010, 011, 100, 101, 110 and 111) 
can be stored in each of 3 x 4 cells of a smaller 36-bit memory array. Each state corresponds to a specific 
configuration of domains of opposite polarization in the cells. 
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A brainy background 


The vertebrate brain is so complex that 
tracing its origins among invertebrates is 
problematic. It must have evolutionary 
roots somewhere, but where? On page 289 
of this issue, Pani et a/. point the way with 
their finding that acorn worms have 
gene-expression signatures very similar 

to those that direct vertebrate brain 
development (A. M. Pani et al. Nature 483, 
289-294; 2012). 

The closest relatives of vertebrates are the 
urochordates (tunicates, or sea squirts) and 
cephalochordates (the superficially fish-like 
lancelet). More distant on the evolutionary 
tree lie the hemichordates, which include 
acorn worms such as Saccoglossus 
kowalevskii (pictured), and the echinoderms 
(including starfish and sea urchins). Together, 
these groups comprise the deuterostomes, 
among which the elaborate structural and 
genomic innovations of vertebrates stand 
out rather like a cuckoo chick in a nest of 
willow warblers. 

One such feature is the signalling 


HDDs provide the highest-capacity solu- 
tions, with an unbeatable cost per bit. But they 
are slow and prone to failure when shocked, 
causing the loss of documents, data or holi- 
day pictures. Solid-state memories such as 
Flash, which are ubiquitous in memory sticks 
and memory cards for cameras, are faster and. 
more robust than HDDs. However, they can- 
not match the cost per bit and maximum stor- 
age capacity of HDDs. But they are well suited 
to mobile electronic devices, and the consensus 
is that most future data-storage solutions will 
be based on solid-state memories. 

The question is how to increase storage den- 
sity and reduce the cost of solid-state memo- 
ries while maintaining their good performance 
(fast data-access and write times, low power 
consumption and high endurance). Follow- 
ing the advice of physicist Richard Feynman, 
researchers have for decades exploited the 
“room at the bottom” — that is, they have tried 
to reduce the size of memory cells. Impressive 
progress in nanometre-scale lithography has 
allowed a marked reduction in the size of a 
memory cell. So today, the smallest commer- 
cial Flash memory cell* has a lateral size of 
about 80 nanometres. 

To further reduce the size of memory cells, 
problems related to charge leakage, which 
causes undesired data erasure, and general 
reliability issues must be solved. This is prov- 
ing more challenging than ever as cells reduce 
in size. One solution is to increase storage 
density not by using fine-pitch lithography 
but by piling up two-dimensional arrays of 


centres — physical regions of a developing 
embryo in which a specific set of secreted 
proteins drives the formation of a particular 
body section. These centres are present in 
the vertebrate neuroectoderm, an embryonic 
tissue that gives rise to the brain and nervous 
system. They are modified or completely 
absent in tunicates and lancelets, suggesting 
hat many aspects of brain development are 
unique to vertebrates and arose de novo in 
the vertebrate lineage. 

Pani and colleagues’ data contradict this 
theory. They show that a gene-expression 
program similar to that in three vertebrate 
neuroectodermal signalling centres is also 
present in S. kowalevskii. Although acorn 
worms do not have anything like a brain, 
Pani et al. show that these genetic programs 
are used to pattern (direct the development 
of) the ectoderm of the embryonic animal. 
The genes expressed are equivalent to those 
found in vertebrates, and they are deployed 
at similar times and places in vertebrate and 
acorn-worm embryos. 


memory cells and creating three-dimensional 
structures, thereby exploiting the ‘room at 
the top. The practical implementation is, 
however, extremely difficult in terms of 
circuitry, which prevents the pile-up of more 
than a few layers. 

In their study, Lee et al.’ propose another 
method of data storage that exploits ‘the room 
inside’ the memory cell itself to store more 
than just one bit of information. Rather than 
Flash, the authors address ferroelectric ran- 
dom access memories (FERAMs), which, 
along with phase-change and magnetic mem- 
ories, are an alternative type of non-volatile 
random-access-memory technology’. In a 
FERAM cell, a bit of information (a ‘1’ or ‘0’ 
logic state) is usually stored in the direction 
(up or down) of the polarization ofa ferroelec- 
tric material — one that can retain a permanent 
electric dipole in the absence of an electric field 
(Fig. 1a). The polarization of a ferroelectric is 
the sum of the electric dipoles present in each 
unit cell of the material. Its direction can be 
switched by applying a voltage, just as the mag- 
netization in a ferromagnet (a material that 
maintains a magnetic dipole in the absence 
of a magnetic field) can be reversed with a 
magnetic field. 

Rather than just relying on the direction 
of the polarization to store information ina 
FERAM cell, Lee and colleagues make the 
best of the physical mechanism at the heart of 
a memory operation by taking advantage of 
the structure of domains of different polariza- 
tion orientation in the ferroelectric material. 
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The researchers propose that these 
signalling centres were part of an ancient 
gene-regulation ‘scaffold’ that was present in 
the common ancestor of all deuterostomes, 
and that was retained in hemichordates 
and vertebrates but lost in the evolutionary 
branches that formed lancelets and 
urochordates. Such an idea implies that 
the common ancestor was unexpectedly 
complicated. This does not mean that it 
had a complex brain, but that the genetic 
programs that were eventually modified 
for use in patterning the vertebrate brain 
already existed in an ancestral creature that 
lived perhaps more than 600 million years 
ago. Henry Gee 


When a voltage is applied to reverse a ferro- 
electric’s polarization, it usually does not 
reverse all of the material’s constituent electric 
dipoles homogeneously. Instead, small regions 
(the domains), each with their own polariza- 
tion, nucleate and then progressively expand. 
Depending on the voltage value, or the time 
during which it is applied, configurations — in 
which up and down domains coexist — can 
be stabilized. The net polarization is the sum 
of the up and down domains, and thus inter- 
mediate states of net polarization may be used 
to store information. These intermediate states 
define a multilevel memory element. 
However, owing to the stochastic nature of 
ferroelectric polarization switching, control- 
ling the up- and down-domain fractions by 
this voltage-based procedure creates poorly 
defined final intermediate polarization states. 
Lee et al.’ show that this poor definition can 
be greatly diminished by limiting the electri- 
cal current that appears in the ferroelectric 
when its polarization starts to be reversed by 
the applied voltage. Thus, complete polariza- 
tion reversal is hampered, and the ferroelectric 
is left in a multidomain configuration. In this 
way, the authors were able to write eight well- 
defined logic states (000, 001, 010, 011, 100, 
101, 110 and 111; Fig. 1b), converting a one-bit 
conventional FERAM memory element into a 
three-bit one. Although the main results were 
obtained on single-crystalline ferroelectric 
materials, they also show that the approach is 
applicable to polycrystalline films, which are 
typically used in commercial FERAMs. 


A.M. PANI 


The storage density of today’s FERAMs is 
tenfold lower than that of Flash®, but exploit- 
ing ferroelectric domains could greatly 
improve it. Along with their excellent power 
consumption, endurance and write times, 
such improved storage density would put 
FERAMs back in the race towards a universal 
memory. In addition, the use of multidomain 
structures to encode intermediate polarization 
states could be applied to other devices based 
on ferroelectric layers, such as ferroelectric 
transistors’, tunnel junctions® or switchable 


photovoltaics’. Lee and colleagues’ work also 
emphasizes how a deep understanding of the 
physical processes underlying information 
storage allows even the highest technological 
hurdles to be overcome. = 
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Broken giant linked 
to heart failure 


Genetic mutations can cause a type of heart disease called dilated cardiomyopathy, 
by predisposing the organ to enlarge and function poorly. It has now been found 
that 27°% of cases are due to mutations that disrupt the muscle protein titin. 


ELIZABETH M. MCNALLY 


ardiomyopathy — a disorder of heart 

muscle — is a leading cause of conges- 

tive heart failure, in which the organ 
cannot pump enough blood to meet the 
body’s demands. Mutations account for many 
cardiomyopathy cases, but most of these gene 
variants are rare — often described from an 
individual or a single family’. In particular, 
mutations associated with a form of the dis- 
ease known as dilated cardiomyopathy have 
been identified in about 50 different genes, 
and the gene most commonly found to be 
mutated until now (LMNA) explains only 
5% of cases. In a landmark study published 
in the New England Journal of Medicine, 
Herman et al.” report that as many as 27% cases, 
tested using modern sequencing techniques, 
can be attributed to dominant mutations in 
the T'TN gene, which encodes titin, the largest 
human protein. 

Titin is found in the sarcomeres, the con- 
tractile protein structures in heart muscle 
cells (Fig. 1). The protein is considered to be 
a molecular ruler that monitors the length 
of the sarcomere during muscle contraction 
and relaxation, and transmits this informa- 
tion to the rest of the cell. Consistent with this 
role, the activity of a carboxy-terminal kinase 
domain in titin is stimulated by muscle activity 
and stretch’. 

Mutations in TTN have previously been 
linked* to some cases of dilated cardiomyo- 
pathy (DCM), but comprehensive analyses 
of the gene and its mutations have been ham- 
pered by its complexity and exceptionally 
large size. TT'N contains nearly 350 exons 


(protein-coding regions), which encode a pro- 
tein of about 35,000 amino acids, with many 
repeated sequence motifs. The arrival of faster 
and cheaper sequencing technologies now 
makes such analyses feasible. 

To explore the link between TTN and 
DCM, Herman and colleagues” used an ‘exon- 
capture’ technique, in which they isolated and 
sequenced T'TN exons from 312 patients with 
DCM, 231 patients with hypertrophic cardio- 
myopathy (HCM) and 249 control subjects 
without cardiomyopathy. The authors used 
two approaches for gene sequencing: massively 
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parallel (or ‘next-generation’) sequencing, and 
traditional Sanger sequencing. They found 
that 17% of the DCM patients had mutations 
that are likely to produce truncated forms of 
titin. This frequency increased to 27% for 
those DCM patients whose DNA was evalu- 
ated using massively parallel sequencing. 
By contrast, only about 1% of HCM cases 
and 3% of the control subjects had protein- 
truncating mutations. What’s more, TTN 
mutations and DCM were co-inherited in 
those individuals whose families participated 
in the study. Taken together, these results 
clearly support the idea that mutations in TTN 
have a key role in DCM. 

The TTN variants observed by the authors 
were not randomly distributed along the 
length of the gene: they were more frequent 
in an area coding for a titin region that inter- 
acts with the protein myosin in the sarcomere 
(Fig. 1). In particular, many of these muta- 
tions would disrupt titin’s kinase domain, 
probably impairing the ability of the pro- 
tein to signal to the cell that the sarcomere 
has lengthened abnormally*. The inability 
to sense or signal when the sarcomere has 
elongated too far may render heart cells 


Sarcomere 


Figure 1 | A molecular ruler. Muscle fibres and heart cells are composed of repeated contractile units 
called sarcomeres that are made up of actin and myosin protein filaments, together with other proteins 
such as titin. Actin filaments are anchored to non-contractile protein structures at the borders of the 
sarcomere, known as ‘Z bands. Titin spans half the length of the sarcomere and is thought to act as a 
molecular ruler that senses changes in sarcomere length during muscle contraction and elongation. 
Herman et al.’ report that 27% of patients with dilated cardiomyopathy, a disorder of heart muscle, 
have protein-truncating mutations in the titin-encoding gene. Many of these mutations would lead to 
disruption of a protein kinase domain (TK) in the protein, which would in turn result in an impaired 
ability to monitor sarcomere lengthening and, hence, in a malfunction of heart muscle. 
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susceptible to damage from muscle activity. 
Some of these mutations may also disrupt tit- 
in’s ability to recoil during muscle contraction’. 

In their analysis, Herman et al.” considered 
only rare, protein-truncating TTN variants 
(occurring with a frequency of less than 1%). 
So, how these mutations interact with the more 
common, non-truncating TT'N mutations — 
such as those that change a single amino acid 
— or with cardiomyopathy-associated vari- 
ants of other genes remains to be evaluated. 
Modelling the cumulative effect of multiple 
gene variants on the heart’s physiology will 
require more sophisticated analyses. 

The authors’ data suggest that massively 
parallel sequencing is more sensitive than 
Sanger sequencing in detecting T'T'N variants. 
A reason for this could be that this type of 
sequencing determines a given sequence many 
more times than the older technique. This 
repeated sequencing, or possibly the analyses 
used to align the resulting sequences, may be 
more suited to the repetitive nature of TTN. 
In the immediate future, DNA microarrays 
designed to capture specific exons on the 
basis of bioinformatic identification’ could be 
used to analyse T'T'N variants. However, such 
microarrays should be carefully evaluated for 
their use in DCM diagnostics, as the bounda- 
ries of some of the TTN exons are incompletely 
defined in the most commonly used databases 
(those of the National Center for Biological 
Information in Bethesda, Maryland, and the 
University of California, Santa Cruz). Alter- 
native approaches involving whole-genome 
sequencing might therefore be favoured for 
DCM diagnostics. 

The actual outcome of DCM-related muta- 
tions, however, depends on the rest of an 
individual’s genetic make-up and on envir- 
onmental factors. For example, the effects of 
protein-disrupting TTN variants would be 
expected to be worse when combined with 
factors such as dysfunctional heart valves or 
hypertension. Interestingly, in the present 
study’, men with TTN mutations had more 
severe DCM than women. Male hearts may be 
more sensitive to TTN mutations because they 
are normally larger than female hearts, and a 
larger heart faces greater strain and pressure. 

Herman and colleagues’ observation 
that a few of the control individuals carry 
titin-disrupting gene variants indicates that 
there are likely to be additional genetic or 
environmental factors that enhance the dis- 
ease-causing potential of these mutations. 
Alternatively, these symptomless individu- 
als might carry other gene variants that 
actively suppress the effects of the TTN muta- 
tions. For example, gene variants that drive 
increased degradation of the mutated TTN 
messenger RNA, or of the truncated titin, could 
reduce the amount of the damaged protein in 
the heart, and thereby limit the negative effects 
of TTN mutations. So, for a small percentage of 
people, life with less titin may not be too bad. 


Diagnosing the genetic basis of DCM 
provides useful information that could guide 
therapy, especially when there is a risk of 
life-threatening — but treatable — irregu- 
lar heart rhythms*. Most diagnostic testing 
now relies on sequencing many genes at once, 
as there are few clinical clues that reduce 
the number of candidate genes. However, 
given the frequency of titin-disrupting vari- 
ation reported by Herman and colleagues, 
mutations in TT'N now emerge as primary 
causes of, or highly potent contributors to, 
cardiomyopathy. = 
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A duo of graphene 


mimics 


The synthesis of analogues of graphene by two different means provides insight 
into the origins of massless particles and paves the way for studies of materials 
with exotic topological properties. SEE LETTERS P.302 & P.306 


JONATHAN SIMON & MARKUS GREINER 


hen a particle is constrained to move 
in a honeycomb-lattice structure, 
its properties change dramatically: 
it behaves as though it has no mass and travels 
at the speed of light. Such particles, known as 
massless Dirac fermions, were first observed'* 
in atomic graphene in work that spawned a 
frenzy of research into its properties, both to 
understand the fundamental science and for 
technological applications. In a pair of letters 
published in this issue, Gomes et al.’ (page 306) 
and Tarruell et al.’ (page 302) describe how, 
for the first time, they have created synthetic 
analogues of graphene in two different systems. 
Atomic graphene is a single layer of carbon 
atoms organized into a honeycomb struc- 
ture. The new graphene analogues™* share 
this honeycomb topology and offer substan- 
tial advantages in manipulation and readout 
of material properties over what is possible 
with atomic graphene. Gomes et al.’ assem- 
bled ‘molecular graphene’ from individually 
placed carbon monoxide (CO) molecules, and 
studied the bizarre effects that arise from small 
variations in the material’s lattice structure 
(Fig. 1a,b). Tarruell et al.* show that a clever 
arrangement of laser beams (an optical lattice) 
produces a honeycomb structure to confine an 
ultracold gas of potassium atoms, and inves- 
tigated how the lattice structure controls the 
mass of the atoms (Fig. 1c,d). 
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The first step in studying a graphene ana- 
logue is to show that its particles are massless. 
In everyday life, an object’s mass is measured 
by weighing it on a scale. That is impossible for 
an extremely light object such as an electron, 
particularly when its mass is inextricably tied 
to the material in which it resides. Gomes and 
colleagues’ circumvented this difficulty by 
studying the energy gap between the valence 
and conduction energy bands in their molecu- 
lar graphene. This quantity reflects the energy 
required to create an electron-hole pair out of 
the vacuum (where a hole is a charge carrier 
created by the absence ofan electron), and thus 
the electron’s mass. The authors measured the 
gap using scanning tunnelling microscopy 
and spectroscopy, and observed no separation 
between the valence and conduction bands — 
only a sharp dip at the Dirac point, at which 
the bands touch. The dip indicates a vanish- 
ing number of quantum states at zero energy, 
and the absence of a gap is a clear signature of 
massless Dirac fermions. 

Having established the existence of massless 
electrons in their molecular graphene, Gomes 
and colleagues employed their exquisite con- 
trol of the material to study how slight modi- 
fications to the lattice geometry change the 
properties of its resident electrons. First, they 
introduced a periodic arrangement known as 
a Kekulé structure to reimbue their massless 
Dirac fermions with mass, and detected this 
mass through the appearance of an energy 


gap in the material’s electron-energy spec- 
trum. They then showed that a distortion of 
the lattice structure, akin to squeezing it along 
several axes, makes the electrons act as if they 
are in a magnetic field’. The appearance of an 
isolated, zero-energy quantum state in the 
presence of this apparent field serves as striking 
confirmation of the existence of massless Dirac 
fermions in the molecular graphene. 

Meanwhile, Tarruell and colleagues’ demon- 
strated that the potassium atoms in their 
material behave as massless Dirac fermions by 
watching them undergo a transition between 
the conduction and valence bands. Such transi- 
tions require substantial energy input for parti- 
cles that have mass, to overcome the particles’ 
rest mass and bridge the energy gap. But for a 
massless particle there is no energy gap, so the 
particle can move freely between valence and 
conduction bands at the Dirac point. 

Holding a massless Dirac fermion near a 
Dirac point in order to observe transitions 
between bands would be extremely chal- 
lenging in the solid state, because motional 
damping is difficult to control, and scattering 
off lattice impurities and phonons (quasi- 
particles associated with lattice vibrations) 
regularly randomizes the Dirac fermions’ 
momentum. Ultracold atomic gases in optical 
lattices are ideally suited to overcoming this 
problem, as the atoms move without dissipa- 
tion in extremely clean and tightly controlled 
environments®. 

Tarruell and co-workers took advantage 
of these features in their work. They started 
with a cloud of fermionic (half-integer spin) 
potassium atoms in the valence band. These 
atoms exhibited a spread in momentum due 
to the Pauli exclusion principle, according to 
which two or more fermionic particles cannot 
occupy the same quantum state. The authors 
then subjected the atomic cloud to a magnetic- 
field gradient that gently accelerated the atoms, 
slowly varying their momentum. In this set- 
ting, any atom whose momentum trajectory 
crosses a Dirac point should be transferred to 
the conduction band (Fig. 1d). 

Tarruell et al. elegantly observed the existence 
of Dirac points by imaging the momentum 
distribution of their cloud. They achieved 
this by allowing the cloud to expand and then 
taking an image of its spatial distribution, 
noting that atoms in the cloud that have more 
momentum move more quickly in this expan- 
sion. Before the acceleration, the atoms are 
distributed smoothly around zero momentum, 
whereas after acceleration the distribution 
has gaps corresponding to atoms that have 
been transported through a Dirac point, and 
hence transferred to the conduction band. 

To address the question of how the Dirac 
fermions arise, Tarruell and colleagues 
smoothly tuned the energy potential of their 
structure to transform the underlying honey- 
comb lattice, which has two Dirac points, into 
a dimer lattice, which has none. To probe for 
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Figure 1 | Two approaches to making graphene analogues. a, Gomes et al.* assembled molecular 
graphene using a scanning tunnelling microscope (STM), the tip of which is made of iridium atoms, to 
individually position (blue arrow) carbon monoxide (CO) molecules in a hexagonal pattern on a copper 
substrate. b, STM image of the partially assembled graphene. Electrons are repelled from the hexagonally 
arranged CO molecules and forced to move on a honeycomb grid (black lines). The blue arrow indicates 
the path of aCO molecule as it is positioned. c, Tarruell et al.* generated a honeycomb optical lattice 

for confining ultracold potassium atoms through a combination of two interfering laser beams (red) 

and a third separate beam (blue), all retro-reflected by mirrors. The resulting intensity pattern is shown 
overlaid, with the intensity maxima shown in white and the tunnelling pathways between maxima 
emphasized by red lines. d, To observe the existence of Dirac fermions, the authors’ accelerated potassium 
atoms in the material's valence energy band and found that those that pass through a Dirac point are 
transferred to the conduction band. The plot displays the electron energy (E) as a function of its momenta 
in the x and y directions (q, and q,) for both bands. The arrows denote the paths of the potassium atoms 


through momentum space. 


the existence of Dirac points in each lattice, 
they accelerated the potassium atoms and 
measured the fraction of atoms transferred 
to the conduction band. What they find is an 
impressive verification ofa theoretically pre- 
dicted topological phase transition’ *: the atom 
fraction transferred drops abruptly to zero at 
precisely the point at which the dimer lattice 
forms and the two Dirac points merge and 
annihilate one another. 

These studies™* pave the way for anew realm 
of condensed-matter physics, in which mate- 
rials with exotic topological properties can be 
built to order from the ground up. However, 
some of the most interesting properties arise 
when interactions between particles cause 
them to self-organize into intricate patterns 
determined by the laws of quantum mechanics. 
To this end, it will be essential to suppress the 
role of the copper substrate underlying Gomes 
and colleagues’ material, because it screens out 
the repulsive Coulomb interaction between the 
electrons and limits the electrons lifetime. 

By contrast, the ultracold atoms described 
by Tarruell et al.* are extremely long lived and 
can exhibit strong interactions, but so far have 


been studied only in synthetic magnetic fields”® 
that are not strong enough to investigate the 
physics of highly correlated Dirac fermions. 
Much stronger fields should be achievable 
through temporal modulation of the Dirac 
points’’. Once these limitations are overcome, 
the techniques developed by Tarruell, Gomes 
and their colleagues should point the way to a 
new generation of quantum materials. m 
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How brains learn to 
control machines 


After training, animals and humans can make their thoughts interact directly 
with computers. A study provides evidence that the corticostriatal system of the 
brain is essential for this learning process. SEE LETTER P.331 


DAVID T. BLAKE 


rain—machine interfaces have a rich 
B history in the sci-fi genre: in The Matrix 

films, human brains are plugged into a 
computer-based simulation that then becomes 
their ‘reality. But using our thoughts to directly 
control computers or other devices is not just 
in the realm of fantasy. Monkeys can learn to 
use visual cues to instruct a brain—machine 
interface to move a robotic arm or a com- 
puter cursor’’. And electrode arrays were 
implanted into the brain ofa paralysed man in 
2006, enabling him to control an artificial arm, 
to move a cursor on a computer screen and 
even to open e-mail’. Over time, an individual 
learns to improve their control over the brain- 
machine interface by modifying the activity of 
their brain, but how this happens is not well 
understood. On page 331 of this issue, Koralek 
et al.* report that the corticostriatal system of 
the brain is involved in learning mental actions 
and skills that do not involve physical move- 
ment, such as those required for control of 
brain-machine interfaces’. 

The corticostriatal system has a unique pat- 
tern of connectivity that enables sensory inputs 
to be associated with appropriate motor or 
cognitive responses’. It consists of a cortical 
component, the primary motor cortex, that 
exerts control over muscles, and a striatal 
component, the basal ganglia, that receives 
direct inputs from the motor cortex. The basal 
ganglia are involved in a wide range of learning 
conditions and are crucial to the motor deficits 
observed in Parkinson’s and Huntington's dis- 
eases. Both corticostriatal components have a 
role in the learning and execution of physical 
skills requiring movement. 

It was known that the learning of abstract 
(non-physical) skills, such as controlling a 
brain-machine interface, required the motor 
cortex, but it was not clear whether the striatal 
component was also involved. To address this 


“This article and the paper“ under discussion were 
published online on 4 March 2012. 


issue, Koralek and colleagues* implanted elec- 
trodes in the cortex and striatum of live rats to 
record neuronal activity. The authors then set 
up a brain-machine interface (Fig. 1) in which 
a tonal sound with a specific pitch was auto- 
matically played to the animal depending on 
the measured activity of two groups of corti- 
cal neurons. The pitch of the sound increased 
if the first group of neurons was more active 
than the second group, and decreased if the 
second group was the more active. If the pitch 
got high enough, the rat earned a sugar-water 
reward; if the pitch dropped enough, then the 
animal received a food pellet. 

Over a two-week period, the rats achieved a 
high level of proficiency at earning each reward. 
The researchers carried out a similar experi- 
ment with another set of animals, with the 
difference that no sound was played. In this 
case, the rats did not learn to alter activation 
of the two groups of neurons to earn rewards, 


which confirmed the necessity of the auditory 
feedback. The animals that did learn developed 
a synchrony between the activity of their motor 
cortex and that of their striatum during task 
performance that was absent before learning. In 
a sense, the task learning caused the cortex and 
striatum to work in harmony. This synchrony 
is perhaps not entirely surprising, as it is known 
that the striatum and cortex are active and dis- 
play similar temporal patterns of activity ina 
wide range of motor and cognitive procedures’. 

Were the rats controlling the brain-machine 
interface in an intentional and goal-directed 
fashion? In a human brain-machine interface, 
we would simply ask the subject if they had a 
conscious knowledge that they were controlling 
an external interface. In the case of laboratory 
animals, such direct feedback is not possible, 
but Koralek et al. got part of the answer through 
clever experimental design. The authors gave 
sugar water to the rats before training, so that 
the animals were only motivated to get the food 
pellet afterwards. However, if the task perfor- 
mance were habitual and unintentional, the 
rats would not know how to earn one specific 
reward and not the other. What happened was 
that the animals did learn how to get the food 
pellet and not the sugar water, and this was 
associated with the change in brain activity pre- 
dicted for goal-directed, intentional behaviour. 

To assess the necessity for the striatum 
in the learning task, Koralek et al.* repeated 
their experiments using genetically modified 
mice’ in which NMDA receptors were spe- 
cifically inactivated in the striatum. NMDA 
receptors are required for normal functioning 
of the cortex-striatum connections, and so 
the genetically modified mice should have a 
greatly reduced ability to coordinate the two 
brain regions. Indeed, the authors observed 
that the mice could not learn how to achieve 
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Figure 1 | Music to my brain. Koralek et al.’ implanted electrodes into the brains of live rats to record the 
neuronal activity of the motor cortex and the striatum. Depending on certain features of motor-cortex 
activity, a tonal sound with a specific pitch was automatically generated. The rats learnt to make the pitch 
of the sound rise or fall by modifying their brain activity, as they were rewarded with either sugar water 

or food pellets, respectively, if the pitch changed enough. By using genetically modified mice in similar 
experiments, the authors showed that activity of the striatum was required for the animals to learn the task. 
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the rewards, which supports the hypothesis 
that developing coordination between the 
striatum and the cortex is necessary for 
abstract learning. 

Mechanistic studies of learning are inher- 
ently difficult. Most experiments sample 
activity before and after some type of learn- 
ing and make inferences about what has 
happened in the interim. Others focus on 
certain tasks that can be learned in less than 
an hour or so, because this is the time frame 
during which stable recordings from single 
neurons can be reliably obtained. Implant 
technologies such as brain-machine interfaces 
have opened up new possibilities, because they 
allow researchers to record neuronal activity 
from a particular brain location over long 
periods of time. By resampling these loca- 
tions in rats and mice throughout learning, 
Koralek et al. document that the animals can 
learn abstract skills by controlling neuronal 


BIOCHEMISTRY 


activity at multiple, specific, predetermined 
locations in the cerebral cortex, and that the 
neural basis for this learning also occurs in 
the cortex-striatum interplay. This find- 
ing augments the literature that inseparably 
ties motor functions and complex cognitive 
functions to the interconnected networks 
associated with the brain's frontal lobe. 

Half a century ago, no one would have 
believed that it would be possible to implant 
a cardiac pacemaker into a person's chest that 
would restart their heart automatically. Today, 
these electrophysiological devices are com- 
mon, as are cochlear implants for improved 
hearing, and as will soon be the case for retinal 
implants that will give at least part of their lost 
sight back to some blind people*”. Preliminary 
trials with brain-machine interfaces in humans 
are now in progress’, but are hampered by the 
limited stability of the primary link between 
the brain and the recording electrode. If the 


Favouring the 


unfavoured 


The naturally occurring antibiotic lasalocid A contains a chemical structure that 
is not expected to form readily. The enzyme that catalyses the formation of this 
structure has been identified, and its activity is a revelation. SEE LETTER P.355 


DAVID E. CANE 


are widely used to control parasitic infec- 
tions in farm animals, and to enhance 
feed efficiency in cattle and other ruminants’. 
They act by binding and carrying alkali-metal 
cations and protons across bacterial mem- 
branes, thereby interfering with essential ion 
gradients. The polyethers have also attracted 
considerable attention from scientists who 
want to identify the biosynthetic origins of 
these structurally complex substances and 
unravel the intricate pathways by which they 
are assembled’. On page 355 of this issue, 
Hotta et al.’ report the intriguing mechanism 
of a step in the biosynthesis of lasalocid A, 
which they worked out using a combination 
of synthetic organic chemistry, enzymol- 
ogy, protein structural biology and quantum 
mechanics. Their findings provide insight into 
the molecular basis for the products formed by 
this fascinating class of enzymes. 
Polyether molecules are characterized by 
a complex array of two or more cyclic ethers 
— oxygen-containing rings that, in polyether 
antibiotics, are typically composed of either 
five or six atoms. The carbon backbones of 
polyethers are assembled by enzymes known 


| asalocid A and other polyether antibiotics 


as polyketide synthases from simple build- 
ing blocks such as acetate, propionate and 
butyrate, which contain two, three and four 
carbon atoms, respectively. 

A general model for the biological forma- 
tion of polyethers was formulated’ in 1983. 
It postulates that intermediates called epox- 
ides (cyclic ethers consisting of three atoms) 
undergo a cascade of enzyme-catalysed 
reactions to form the polyether rings. The 
energetics for this process resembles that of the 
gravity-driven tumbling ofa line of dominoes: 
the enzyme triggers the reaction cascade, just 
as the fall of the first domino causes the other 
dominoes to topple. In the case of polyethers, 
however, the enzyme also controls the number 
and size of the ether rings that form, which is 
analogous to steering the tumbling dominoes 
to control precisely where each one lands. 

In the past decade, considerable progress 
has been made in elucidating the steps of poly- 
ether biosynthesis. The enabling advance was 
the identification”** of bacterial gene clusters 
that control the formation of several important 
polyethers. Each gene in these clusters encodes 
an enzyme that catalyses a specific step in the 
biosynthetic pathway. Selectively knocking 
out some of the genes, to interrupt the path- 
way when the remaining genes in the cluster 
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link can be stabilized for longer durations, these 
interfaces may one day become as common as 
other electrophysiological devices and provide 
us with similarly useful services. = 
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are expressed in cells, has enabled the func- 
tions of the deleted genes to be established. As 
a result of such deletions, several compounds 
that accumulate have been identified as key 
intermediates in the pathway, including the 
unsaturated polyketides that are initially 
generated from the acetate, propionate and 
butyrate building blocks. 

Similar studies have also begun to clarify 
the final stages of polyether biosynthesis. For 
example, oxidative enzymes that generate the 
key epoxide intermediates (known as poly- 
epoxides) from unsaturated polyketides have 
been discovered, as have the epoxide hydrolase 
enzymes that catalyse the reaction cascade in 
which the polyepoxides are converted into 
polyethers”. A group led by Hideaki Oikawa 
(one of the authors of the new paper’) recently 
expressed and purified’ an intriguing epoxide 
hydrolase, Lsd19. The team established that 
this enzyme is responsible for converting a pre- 
cursor molecule harbouring two epoxide rings 
into a mixture of two closely related polyethers 
— lasalocid A and isolasalocid A (Fig. 1). 
Both compounds contain two cyclic ethers, 
but the second of these in lasalocid A is a six- 
membered ring, whereas in isolasalocid A it is 
a five-membered ring. 

The authors went on to show’ that Lsd19 
consists of two halves — Lsd19A and Lsd19B 
— that have similar amino-acid sequences. 
They found that Lsd19A catalyses the forma- 
tion of the first ether and Lsd19B controls the 
formation of the second. The team also iden- 
tified acidic and basic amino-acid residues at 
the active sites of each domain, and reported 
evidence implicating these residues in the 
catalysis of the reactions that convert their 
epoxide substrates to cyclic ethers. 

Hotta et al.’ now report the high-resolution 
(1.59-angstrém) X-ray crystal structure of 
Lsd19, in which a synthetic, uncyclized sub- 
strate analogue is bound to the half of the 
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lsolasalocid A 


Figure 1 | Polyether antibiotics. The naturally 
occurring antibiotic lasalocid A is biosynthesized 
as a mixture with its isomer, isolasalocid A. 

The structures differ in that the terminal cyclic 
ether (red) in lasalocid A consists of six atoms, 
whereas the equivalent ether in isolasalocid A 
(blue) has five. Hotta et al.’ report the mechanism 


of formation of the six-membered ether in 
lasalocid A. 


protein corresponding to Lsd19A and a syn- 
thetic product analogue carrying two ether 
rings is bound to the active site of the Lsd19B 
domain. The structure clearly shows that the 
active-site cavities of each half of the protein 
have very similar overall architectures and 
general catalytic features, but different depths. 
The consequent differences in active-site shape 
and volume can reasonably account for the dis- 
tinct role of each domain in catalysing different 
stages of the two-step, ether-forming reaction 
cascade. The high-resolution structure also 
reveals candidates for the catalytically impor- 
tant acidic and basic residues in the active sites 
of each protein subunit. 

The authors then probed the enzyme in even 
more depth by carrying out theoretical studies 
to model the reactions catalysed by Lsd19B, 
using quantum-mechanical calculations that 
incorporated the experimentally determined 
positions of the catalytic residues in the active 
site of the Lsd19B domain. In particular, they 
wanted to learn how the enzyme preferentially 
catalyses the formation of a six-membered 
cyclic ether, given that a five-membered ring 
would normally form more easily in the non- 
enzyme-catalysed reaction (according to well- 
established rules’ for predicting the outcomes 
of ring-formation reactions in solution). 

Hotta et al. began by modelling ether-ring 
formation in the presence of either a simple 
acid or a base, but in the absence of an 
enzyme. As expected, the theoretical model 
predicted that the five-membered ring forms 
more quickly than the six-membered ring, 
although the preference is smaller in the 
case of the base-catalysed reaction. Despite 
this difference in formation rate, the authors 
concluded that the product containing the 


six-membered ring is thermodynamically 
more stable than that containing the five- 
membered ring, even though it is more 
difficult to obtain (slower to form). 

By contrast, when they modelled ether-ring 
formation in the presence of the enzyme, the 
calculations showed that formation of the 
six-membered ether was favoured. This is a 
chemical process in which an aspartate amino 
acid acts as a catalytic base, and two other 
amino-acid residues act as acids to stabilize 
the transition state of the reaction. The results 
show that when enzymes have the choice of 
catalysing two competing chemical reactions, 
they can evolve to preferentially accelerate the 
intrinsically slower transformation, until the 
otherwise less-preferred product becomes 
dominant. In other words, enzyme catalysts 
can channel chemical ‘dominoes to fall in a 
different direction from that favoured in the 
absence of the enzyme. 

This impressive study solidifies and extends 
earlier hard-won advances in the study of 
polyether biosynthesis, and has implications 
for the formation of many other polyethers. 
Still unanswered is how the latest findings 
might apply to the biosynthesis of polyethers 
that have more than two ether rings, and to 
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the intriguing ladder-like polyethers found 
in toxins made by marine organisms. A com- 
plete picture of polyether biosynthesis will 
also require a better understanding of how the 
oxidative enzymes involved in earlier stages 
of polyether biosynthesis control the three- 
dimensional structure of the polyepoxides that 
serve as substrates for the intricate cascade of 
ether-forming reactions. = 
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Transcription initiation 


unwrapped 


A genome-wide, high-resolution study of DNA-binding sites for proteins that 
transcribe DNA into RNA reveals details about how this process occurs in vivo. 


SEE ARTICLE P.295 


STEPHEN BURATOWSKI 


ecades of research using purified 
D molecules in vitro have produced a 
basic understanding of the enzymes 
and mechanisms that contribute to gene 
expression in eukaryotes (organisms such as 
animals, plants and fungi). But these results 
must be confirmed in living cells, and a pro- 
ductive approach has been to use chromatin 
immunoprecipitation. This technique reveals 
the genomic locations of DNA-binding pro- 
teins such as those forming nucleosomes — 
DNA segments wrapped around a histone 
protein core — and transcription factors. 
Although these studies have produced a wealth 
of data, they have not always provided mecha- 
nistic insight. However, a paper by Rhee and 
Pugh! on page 295 of this issue sheds light on 
several questions concerning the initiation of 
DNAs transcription into RNA. 
In chromatin immunoprecipitation (ChIP), 
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cells are first treated with a chemical that 
crosslinks proteins and DNA. The cells are 
then disrupted so that their DNA is frag- 
mented. By using specific antibodies, a protein 
of interest is isolated together with any bound 
DNA pieces, and these can then be analysed 
by high-throughput DNA sequencing or other 
techniques. This allows a quick identification 
of all the binding sites for a protein, such as 
those that form protein complexes for tran- 
scription initiation, across an entire genome. 
Rhee and Pugh have previously reported” 
a modification of ChIP, called ChIP-exo, in 
which an enzyme removes all DNA except that 
closest to the protein-DNA crosslink, mark- 
edly improving the technique’s resolution to a 
few DNA nucleotides. The results can be quite 
striking, as illustrated by the excellent correla- 
tion that the authors observe in their present 
study’ between the ChIP-exo crosslink sites for 
the transcription factors TBP (TATA-binding 
protein) and TFIIB, and the protein-DNA 


contacts seen in their crystal structures. 

The researchers analyse promoters — 
sequences that specify where to begin the 
transcription of DNA into RNA — in cells of 
the yeast Saccharomyces cerevisiae, a model 
eukaryotic organism. Most notable are their 
findings regarding how the enzyme that syn- 
thesizes messenger RNA, called RNA polymer- 
ase (Pol) II, is targeted to promoters. TBP is 
known to recognize the “TATA box’ — a spe- 
cific DNA sequence found in many promot- 
ers — and to position Pol II and its associated 
factors at the transcription start site (TSS). 
However, only some promoters, typically 
those that alternate between repressed and 
highly active states, contain an obvious TATA 
box sequence, which represents the optimal 
TBP-binding site’. 

Surprisingly, Rhee and Pugh’s analysis’ of 
TBP-binding sites in “TATA-less’ promoters — 
more prevalent among ‘housekeeping’ genes 
that are expressed ubiquitously — reveals this 
to bea misnomer. These promoters do contain 
TATA boxes, but their sequences stray from 
the standard sequence by two or more DNA 
bases and so their binding to TBP is weaker. 
This finding echoes those from classic studies* 
on the yeast HIS3 promoter, which contains 
two TATA boxes: a weak one for constitutive 
basal transcription and another, clearly rec- 
ognizable, for maximal, regulated expression. 

Although all Pol II promoters seem to share 
a common mode of binding to TBP and basal 
factors, Rhee and Pugh’s data' help to explain a 
functional distinction that has been observed” 
between the two classes of promoter. The 
authors report’ that, at promoters with obvious 
TATA sequences, the TSS — and even the ini- 
tiation complex itself — often overlaps with the 
first nucleosome in the transcribed region. The 
expression of genes containing such promoters 
tends to depend on the presence of the SAGA 
protein complex’, which facilitates nucleo- 
some movement, and thus DNA unwrapping, 
by adding acetyl groups to the nucleosome’s 
histones. Therefore, the first nucleosome 
probably represses the genes’ transcription by 
blocking their TSS, and gene activation occurs 
when the histones are removed by targeted 
acetylation. Once the TSS-containing DNA is 
unwrapped, efficient binding of TBP, and Pol II 
and its associated factors, allows the promoter 
to be expressed at very high levels. 

However, most genes have less-obvious 
TATA boxes, and their expression depends on 
other proteins known as TBP-associated factors 
(TAFs), which together with TBP constitute 
TFIID. In vitro studies® have shown that TAFs 
interact with DNA sequences downstream 
of the TATA box, including sequences around 
the TSS. These additional contacts may help 
to compensate for the weaker TBP binding to 
the DNA, but they probably have other func- 
tions. Rhee and Pugh’ find that, at those 
promoters to which TFIID preferentially 
binds, TSSs are located near the upstream 


Nucleosome ‘ on 
C@ 


DNA 


Transcription 
activator 


Transcription 
start site 


Figure 1 | Two ways of starting the synthesis of 
messenger RNA. Promoters are sequences within 
genes that specify where DNAs transcription into 
RNA starts. a, Some promoters have a clearly 
recognizable sequence, or TATA box, to which 
transcription factor TBP can bind to recruit the 
enzyme RNA polymerase (Pol) II, which synthesizes 
messenger RNA. These promoters can alternate 
between repressed and active states. In the repressed 
state (top), a nucleosome (protein-DNA complex) 
blocks the transcription start site. Transcription 
activator proteins can then recruit additional 
proteins, such as the SAGA complex, to trigger 
nucleosome removal, allowing access to Pol II and 
therefore activating transcription (bottom). b, Most 
promoters lack a clearly recognizable TATA box, 
and their expression depends on the presence of 
transcription factor TFIID, a complex formed by 
TAF proteins and TBP. Transcription activators 
could recruit TAFs, which, in turn, might interact 
with both TBP and a nucleosome to keep the 
transcription start site accessible. 


boundary of the first nucleosome. Therefore, 
TAFs may be positioned in such a way that 
they contact the first nucleosome, preventing it 
from encroaching on the promoter and thereby 
allowing basal gene expression. Indeed, some 
TAFs form a structure resembling the nucleo- 
some histone core’, suggesting that they 
might slot into position within an array of 
nucleosomes. 

In addition to invalidating the concept of 
TATA-less promoters, Rhee and Pugh raise 
questions about two other recent hypoth- 
eses. The first proposes that Pol II promoters 
are intrinsically bidirectional, that is, a single 
TATA box can drive transcription in opposite 
directions. This idea seems plausible because 
TATA sequences are roughly palindromic, and 
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transcript sequencing studies*” have shown 
that the TSSs of many mRNAs are close to a 
non-coding RNA that is transcribed in the 
opposite direction. However, the authors’ 
ChIP-exo data’ show that the nucleosome- 
depleted regions between these divergent TSSs 
harbour two initiation complexes. In other 
words, bidirectional transcription is the result 
of two overlapping but divergent promoters 
driving transcription in opposite directions, 
rather than a single promoter that can fire in 
both directions. 

The second hypothesis” is that ‘gene loop- 
ing’ — the formation of a physical linkage 
between the beginning and end of active genes 
— is mediated, in part, by TFIIB. This model is 
based on observed interactions between muta- 
tions in genes that encode TFIIB and 3’-end 
processing factors (which modify the end of 
mRNA precursors), as well as ChIP localiza- 
tion of TFIIB (but not the rest of the initiation 
complex) at transcription-termination regions 
of selected genes in yeast'’. However, the pre- 
sent study’ and another genome-wide ChIP 
analysis’ failed to localize TFIIB to 3’ ends, 
except in the context of initiation complexes 
at an adjacent promoter. Therefore, the gen- 
eral role of TFIIB in gene looping needs further 
scrutiny. 

It is worth noting that two transcription 
factors occupying the same genomic location 
in ChIP experiments may not actually be there 
at the same time in the same cell, as this tech- 
nique captures a snapshot of events in a cell 
population. In vitro experiments are therefore 
needed to probe the kinetics and intermediates 
of gene expression. Rhee and Pugh’ use many 
biochemical and structural studies to inform 
the interpretation of their ChIP-exo data; the 
ChIP-exo data, in turn, provide an essential 
in vivo test for in-vitro-derived molecular 
models. This synergism underscores the neces- 
sity of applying both approaches to important 
questions in gene expression. m 
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Ancient deuterostome origins of 
vertebrate brain signalling centres 


Ariel M. Pani’, Erin E. Mullarkey**, Jochanan Aronowicz**, Stavroula Assimacopoulos’, Elizabeth A. Grove®*® 


& Christopher J. Lowel?* 


Neuroectodermal signalling centres induce and pattern many novel vertebrate brain structures but are absent, or 
divergent, in invertebrate chordates. This has led to the idea that signalling-centre genetic programs were first 
assembled in stem vertebrates and potentially drove morphological innovations of the brain. However, this scenario 
presumes that extant cephalochordates accurately represent ancestral chordate characters, which has not been tested 
using close chordate outgroups. Here we report that genetic programs homologous to three vertebrate signalling 
centres—the anterior neural ridge, zona limitans intrathalamica and isthmic organizer—are present in the 
hemichordate Saccoglossus kowalevskii. Fgf8/17/18 (a single gene homologous to vertebrate Fgf8, Fgfl7 and Fgf18), 
sfrp1/5, hhand wntl are expressed in vertebrate-like arrangements in hemichordate ectoderm, and homologous genetic 
mechanisms regulate ectodermal patterning in both animals. We propose that these genetic programs were components 
of an unexpectedly complex, ancient genetic regulatory scaffold for deuterostome body patterning that degenerated in 
amphioxus and ascidians, but was retained to pattern divergent structures in hemichordates and vertebrates. 


During vertebrate development, the brain arises from coarsely patterned 
planar neuroectoderm through successive refinement of regional 
identities, resulting after morphogenesis and growth in a complex struc- 
ture composed of highly specialized areas'’. In contrast, invertebrate 
chordates have relatively simple nervous systems that lack unambigu- 
ous homologues of many vertebrate brain regions*®. These clear dis- 
parities in nervous system complexity indicate that key innovations in 
patterning mechanisms have attended the evolution of the vertebrate 
brain from a simpler central nervous system (CNS). However, attempts 
to identify the presumed genetic regulatory novelties have been mainly 
inconclusive, and comparative studies have mostly revealed similarities, 
rather than differences, in the early transcriptional architectures of 
diverse bilaterian nervous systems’. Notable exceptions are CNS sig- 
nalling centres, which have emerged as strong candidates for vertebrate 
genetic regulatory novelties involved in early vertebrate brain evolu- 
tion*°. These centres act as secondary organizers that mediate regional 
patterning in the CNS and are often necessary and sufficient for the 
establishment of vertebrate-specific brain structures’ >>. 

The anterior neural ridge (ANR), zona limitans intrathalamica 
(ZLI) and isthmic organizer (IsO) are the three primary signalling 
centres that direct anteroposterior patterning in the vertebrate 
anterior neural plate and then later in the developing brain’. 
Homologous signalling centres and their molecular signatures are 
absent, or divergent, in amphioxus and ascidians*"'°’””, consistent 
with the idea that they are vertebrate novelties whose origins could 
have driven CNS innovations in stem vertebrates*!°. However, this 
idea depends on amphioxus adequately representing ancestral states 
for chordate developmental genetic characters, and does not account 
for the possibility of secondary losses in cephalochordates. Here we 
present developmental data from the hemichordate S. kowalevskii to 
test an alternative scenario for signalling-centre origins; namely, that 
ANR, ZLI and IsO genetic programs pre-date chordate origins and 


were secondarily simplified or lost along the lineages leading to the 
invertebrate chordates. 

Hemichordates are a deuterostome phylum closely related to 
chordates”® and are a promising outgroup for investigating chordate 
evolution. Previous studies established that despite substantial body- 
plan divergence between the two groups, hemichordates and vertebrates 
share a broadly conserved transcriptional regulatory architecture 
during early body patterning”*””, which is demonstrated by the close 
similarities in spatial arrangements of expression domains of many 
transcription factors that are involved in early bilaterian anteroposterior 
patterning. This combination of morphological divergence and develop- 
mental genetic similarity makes hemichordates an informative out- 
group for testing the proposed coupling of vertebrate morphological 
and developmental genetic innovations. Here, we present descriptive 
and functional evidence that genetic programs homologous to the ANR, 
ZLI and IsO are present in S. kowalevskii, indicating that they were 
elements of an ancient developmental genetic toolkit for deuterostome 
body patterning that were subsequently modified and elaborated in 
stem vertebrates to regulate brain development. 


An ANR-like regulatory program in hemichordates 


The ANR is located in the anterior neural plate of vertebrates and is a 
source of fibroblast growth factors (FGFs) and secreted frizzled-related 
proteins (SFRPs), which establish and pattern the telencepha- 
lon**>76°1, Topologically consistent with vertebrate CNS expression 
domains, sfrp 1/5 and fgf8/17/18 are expressed in S. kowalevskii anterior 
proboscis ectoderm (Fig. la—d). In vertebrates, FGFs and antagonists 
of the Wnt pathway mediate ANR function and telencephalon pattern- 
ing'°°°?. In mice, conditional Fefrl, Fgfr2 and Fefr3 knockout 
abolishes the telencephalon’’, whereas Fgf8 mutants have a posteriorized 
neocortex”’. Similarly, treating S. kowalevskii embryos with the FGF 
receptor inhibitor $U5402 (ref. 34) from late gastrula through to 


1Committee on Evolutionary Biology, The University of Chicago, 1025 East 57th Street, Chicago, Illinois 60637, USA. Hopkins Marine Station, Department of Biology, Stanford University, 120 Oceanview 
Boulevard, Pacific Grove, California 93950, USA. >Committee on Neurobiology, The University of Chicago, 947 East 58th Street, Chicago, Illinois 60637, USA. “Department of Organismal Biology and 
Anatomy, The University of Chicago, 1027 East 57th Street, Chicago, Illinois 60637, USA. 5Department of Neurobiology, The University of Chicago, 947 East 58th Street, Chicago, Illinois 60637, USA. 
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Figure 1 | An ANR-like signalling centre in S. kowalevskii. a-h, S. 
kowalevskii and mouse in situ hybridizations for markers of ANR and 
telencephalon. S. kowalevskii embryos are at the double-groove stage (36 h), and 
are shown in dorsal view with the anterior (proboscis) to the top left of the image, 
except where noted. Embryos are optically cleared except in 0, q (insets) and 
r-w. Mouse embryos are at approximately embryonic day 8.5. a, S. kowalevskii 
sfrp1/5 expression. b, Frontal view of mouse Sfrp1 expression; arrowhead 
denotes ANR. Inset shows lateral view. c, S. kowalevskii fgf8/17/18 expression. 
d, Mouse Fef8 expression; arrowhead denotes ANR. e, S. kowalevskii hh 
expression. f, S. kowalevskii fgf-Sk1 expression. g, Frontal view of double FISH for 
hhand fgf-Sk1. h, fz5/8 expression. i, Anteroposterior expression topologies in S. 
kowalevskii and mouse embryos. Anterior to top. Asterisk indicates Shh is 
expressed in the medial ganglionic eminence, near the ANR. j-k, foxg (j) and rx 
(k) expression in embryos treated with SU5402. 1, m, foxg (1) and rx 


Scrambled siRNA /z5/8 siRNA 


double-groove stages (28-36 h) eliminated the morphological boundary 
between the proboscis and collar, and resulted in altered gene expres- 
sion domains, indicating loss and/or posteriorization of anterior 
proboscis (Fig. 1j-m). Expression of foxg, the S. kowalevskii homologue 
of the telencephalon marker foxg1, was completely eliminated (Fig. 1j, 
1), whereas rx, which is normally excluded from the anterior proboscis, 
was expressed up to the anterior limit of the embryo (Fig. 1k, m). 

Wnt antagonists expressed in the ANR and/or telencephalon are 
also critical for forebrain development in vertebrates”’***. To explore 
potential similarities in Wnt functions between the telencephalon and 
proboscis, we first suppressed Wnt activity in the developing proboscis 
by using short interfering RNA (siRNA) microinjections to knock 
down the Wnt receptor frizzled 5/8, which is expressed with a sharp 
boundary at the proboscis base (Fig. 1h). Fz5/8 siRNA expanded apical 
identity at the expense of the posterior proboscis, as demonstrated by 
expansion of the apical marker fgf-Sk1 (Fig. 1n, p) and contraction of rx 
expression to a more posterior domain (Fig. lo, q). Embryos that were 
injected with a scrambled control siRNA were indistinguishable from 
wild-type embryos (Fig. 1p, q), and the effects of {25/8 knockdown were 
limited to its endogenous expression domain (Supplementary Fig. 1). 
Second, we suppressed Wnt signalling in small patches of proboscis 
ectoderm by injecting B-catenin siRNA into single animal blastomeres 
at early cleavage stages. f-catenin-deficient clones in the posterior 
proboscis ectoderm expressed the apical marker fgf-Sk1 (Fig. 1r-t), 
but not the more posterior proboscis marker rx (Fig. lu-w), which 
indicates a transformation of posterior to apical fate in the absence of 
local Wnt signalling. 

Although not a genetic component of the ANR itself, sonic hedgehog 
(Shh) is expressed in the nearby medial ganglionic eminence’”® (see 
Fig. 2c, d) and interacts with FGFs to regulate telencephalon pattern- 
ing’. The apical S. kowalevskii proboscis ectoderm expresses hedgehog 
(hh; a homologue of Shh) (Fig. le) in a domain that partially overlaps 
Jef-Sk1 (Fig. 1f), with hh more broadly expressed dorsally and laterally 
(Fig. 1g). Embryos injected with hh siRNA lacked apical fgf-Sk1 
expression (Fig. 1x, y) indicating that hh at least partially regulates 
anterior FGF signalling and apical-ventral patterning. This finding 
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(m) expression in embryos treated with DMSO. n, Expanded apical fgf-Sk1 
expression in an embryo injected with {25/8 siRNA. o, Retracted rx expression in 
an embryo injected with fz5/8 siRNA. p, fgf-Sk1 expression in a siRNA control 
embryo. q, rx expression in a siRNA control embryo. Insets in 0, q show frontal 
views of uncleared embryos. r, Wild-type fgf-Sk1 expression. s, fgf-Sk1 expression 
in descendants of a blastomere injected with B-catenin siRNA. t, Merged 
darkfield and fluorescence images showing clonal descendants of the injected 
cell (green). u, Wild-type rx expression. v, rx is not expressed in descendants ofa 
blastomere injected with B-catenin siRNA. w, Merged darkfield and fluorescence 
images showing the location of the B-catenin-deficient clone (green). x, fgf-Sk1 
expression in an embryo injected with hh siRNA. y, fgf-Sk1 expression in a 
siRNA control embryo. Scale bars, 100 jm in S. kowalevskii, and 200 tm (b) and 
500 jim (d) in mice. col, collar; di, diencephalon; hb, hindbrain; mb, midbrain; 
pr, proboscis; tel, telencephalon; tr, trunk. 


raises the possibility that an anterior signalling domain including 
FGFs, Wnt antagonists and hh was present in stem deuterostomes 
and was spatially partitioned during vertebrate evolution. 


A ZLI-like regulatory program in hemichordates 


The ZLI is located in the vertebrate diencephalon and specifies the 
flanking prethalamus and thalamus’’”'*”*. The molecular signature of 
the ZLI is a narrow, transverse domain of Shh expression (Fig. 2c, d) that 
patterns the mid-diencephalon along its anteroposterior axis’”"**. A 
homologous hh expression domain is absent in invertebrate chordates, 
supporting the idea that the ZLI is a vertebrate genetic innovation, 
possibly associated with forebrain origins'°*'~*. Notably, we found that 
S. kowalevskii hh is expressed in a circumferential ectodermal band at 
the proboscis—collar boundary (Fig. 2a) in a transcriptional context of 
expression domains that is similar to the vertebrate forebrain’. 
Expression of the Hh receptor ptch in an overlapping but broader 
domain than hh itself, indicates that hh signals to adjacent ectoderm 
(Fig. 2b) similar to the the ZLI (Fig. 2e, f). The proboscis—collar boundary 
region also expresses fng, otx, wnt8 (Fig. 2g—m, v) and six3 (ref. 7) in 
spatial arrangements that are characteristic of the mid-diencephalon 
and ZLI. In vertebrates, /fng is expressed broadly in the diencephalon 
except for the ZLI (ref. 38), whereas Wnt8b and Otx genes mark the ZLI 
itself in a combinatorial pattern that is unique to this region'® (Fig. 2i, j, 
m).S. kowalevskii hh is also expressed in a fng-negative, otx- and wnt8- 
positive territory posterior to six3 (ref. 7) (Fig. 2g, h, k, 1, v). Notably, the 
hemichordate homologues of irx and fezf, which abut at the ZLI and are 
expressed in anterior abutting domains in Drosophila and amphioxus 
(ref. 10), show divergent expression patterns in S. kowalevskii 
(Supplementary Fig. 2), indicating secondary modification in hemi- 
chordates. In vertebrates, Otx genes are expressed in restricted 
domains in the mid-diencephalon at the stage when the ZLI forms 
(Fig. 2i, j), and otx1l and otx2 knockdown eliminates shh expression 
at the zebrafish ZLI’. Similarly, in S. kowalevskii, hh and otx are co- 
expressed in the presumptive proboscis-collar boundary ectoderm 
shortly after gastrulation (Fig. 2k), and otx siRNA knockdown reduced 
hh expression at the proboscis-collar boundary (Fig. 2w, x). 
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Figure 2 | A ZLI-like signalling centre in S. kowalevskii. a-u, In situ 
hybridizations for S. kowalevskii and mouse homologues of ZLI and 
diencephalon markers. Arrowheads mark the proboscis-collar boundary in S. 
kowalevskii. Mouse images show hemisected heads at embryonic day 10.5, and 
dashed lines indicate the ZLI, with arrows denoting its extent. a, S. kowalevskii 
hh expression. b, S. kowalevskii ptch expression (see Supplementary Fig. 3). 

c, mouse Shh expression. d, Diagram of Shh expression showing ZLI in dark 
blue. e, Mouse Ptch1 expression. f, Mouse Ptch2 expression. g, S. kowalevskii fng 
expression. h, S. kowalevskii otx expression. i, Mouse Otx1 expression. j, Mouse 
Otx2 expression. k, hh (magenta) and otx (green) are co-expressed at the 
presumptive proboscis-collar boundary. 1, S. kowalevskii wnt8 expression. 


m, Mouse Wnt8b expression. n, S. kowalevskii pax6 expression. 0, Double in 
situ hybridization showing pax6 expression (fluorescence, green) anterior to hh 


Hemichordate homologues of diencephalic markers regulated by 
Shh at the vertebrate ZLI are also expressed in similar topological 
arrangements at the S. kowalevskii proboscis-collar boundary. In 
vertebrates, Pax6 and Dlx2 are expressed anterior to the ZLI in the 
prethalamus (Fig. 2p, s), where their expression requires Shh’”"*. 
Similarly, S. kowalevskii pax6 and dlx are expressed anterior to hh 
at the proboscis base (Fig. 2n, 0, q, r, v). In vertebrates, Foxa2 is 
expressed in the ZLI itself'’, and in S. kowalevskii, foxa is expressed 
at the proboscis—collar boundary (Fig. 2t, u). Targeting hh function in 
S. kowalevskii by injecting hh siRNA downregulated dix at the 
proboscis-collar boundary (Fig. 2y, z). However, these embryos had 
other strong defects in anteroposterior and dorsoventral patterning, 
probably owing to hh having additional roles that complicated the 
assessment of hh function at the proboscis-collar boundary (Sup- 
plementary Fig. 3). To reduce early pleiotropic effects, we treated 
embryos with the Hh signalling inhibitor cyclopamine* from the end 
of gastrulation through double-groove stage. Treating embryos with 50 
uM cyclopamine downregulated dix at the proboscis-collar boundary 
with limited effects on midline expression and general morphology 
(Fig. 2a’, b’), suggesting that Hh signalling from the proboscis—collar 
boundary regulates dlx. The prominent similarities in expression of ZLI 
marker homologues in S. kowalevskii and vertebrates, and the con- 
served functions for otx and hh, suggest that an ancestral signalling 
centre homologous to the ZLI was present in early deuterostomes. 


An IsO-like regulatory program in hemichordates 

The IsO is located at the midbrain-hindbrain boundary (MHB) and is 
defined molecularly by abutting domains of FGF8 and WNT1, which 
induce and pattern adjacent neural structures’*"*'***, The search for a 
pre-vertebrate IsO has focused on expression patterns of these ligands 
along with orthologues of the transcription factors otx, gbx, en and 
pax2/5/8, whose combinatorial expression patterns in vertebrates 
define a molecular territory unique to the MHB (Fig. 3b, d, g, i, m, 
p-r). In amphioxus, CNS expression of fgf8/17/18 is restricted to the 
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(colorometric, black). p, Mouse Pax6 expression. q, S. kowalevskii dlx 
expression. r, Double FISH showing dlx expression (green) in the proboscis 
base anterior to hh (magenta). s, Mouse Dix2 expression. t, S. kowalevskii foxa 
expression. u, Double FISH showing foxa (green) and hh (magenta) expression. 
v, Diagram of anteroposterior expression topologies of ZLI and forebrain 
marker homologues in S. kowalevskii and mice. Six3 expression based on 
previous data’. Anterior to top. w, x, otx siRNA downregulates hh expression at 
the proboscis-collar boundary (w) relative to a control siRNA (x). y, z, hh 
siRNA reduces dlx expression in the proboscis base (y) relative to a scrambled 
siRNA (z). a’, b’, cyclopamine treatment reduces dlx expression in the 
proboscis base (a’) relative to a control embryo treated with DMSO (b’). Insets 
show ventral views. Scale bars, 100 [um in S. kowalevskii embryos, 1 mm in mice. 
MGE, medial ganglionic eminence; pth, prethalamus; th, thalamus. 

anterior cerebral vesicle’, and wnt1 is not expressed in the CNS”. In 
Ciona intestinalis, fgf8/17/18 is expressed in the larval visceral ganglion 
where it regulates en and pax2/5/8 to delineate the sensory vesicle and 
neck regions”, suggesting that at least a partial IsO-like signal- 
ling centre pre-dates vertebrates. However, wntl is absent in the 
C. intestinalis genome, making it difficult to infer the full extent of 
this ancestral centre. In Drosophila melanogaster, otx and gbx orthologues 
are expressed in patterns similar to those at the MHB, but the absence of 
compelling similarities in the expression of fgf8/17/18-related genes, 
and repeated expression of wnt1 and en at parasegmental boundaries” 
weakens the idea of homology with the vertebrate IsO. To assess the 
presence of an IsO-like region in hemichordates, we investigated 
expression patterns for S. kowalevskii homologues of vertebrate MHB 
markers (Fig. 3a—t). We found that at double-groove stage (36 h), fgf8/ 
17/18 and wntl are expressed in adjacent ectodermal bands in the 
anterior trunk with wntl expressed anterior to fgf8/17/18 (Fig. 3a, c, 
e); a topology similar to the vertebrate IsO (Fig. 3b, d). In vertebrates, 
abutting domains of otx and gbx genes position the IsO (ref. 3), and 
opposing otx and gbx domains are also found in protostomes’? and 
amphioxus*”, indicating that this pattern is ancestral to bilaterians. A 
reassessment of ofx and gbx expression patterns in S. kowalevskii’ 
revealed that they are also expressed in adjacent domains at the 
collar-trunk coelom boundary, with gbx expressed in the ectoderm 
between the most posterior otx domains (Fig. 3f, h, j). However, the 
spatial arrangements of otx and wntl, and gbx and fgf8/17/18, are 
reversed in S. kowalevskii and vertebrates (Fig. 3t) indicating that diver- 
gent mechanisms position homologous IsO-like regions in these groups. 
In vertebrates, pax2, pax5 and pax8, and en] and en2 are co-expressed at 
the MHB (Fig. 3m, p-r). However, expression of pax2/5/8 and en does 
not overlap in S. kowalevskii (Fig. 3n, 0) or invertebrate chordates””®, 
which suggests that regulation of en genes by pax2/5/8 genes is a novel 
feature of vertebrate development. Beyond similarities in expression of 
signalling molecules and transcription factors, we found that the cate- 
cholaminergic neuron marker tyrosine hydroxylase is co-expressed 
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Figure 3 | An IsO-like signalling centre in S. kowalevskii. a-s, In situ 
hybridizations for S. kowalevskii and mouse homologues of MHB markers. 
Arrowheads mark the S. kowalevskii collar-trunk coelom boundary and the 
mouse IsO. a, S. kowalevskii fgf8/17/18 expression. b, Mouse Fgf8 expression. 
c, S. kowalevskii wnt1 expression. d, Mouse Wnt1 expression. e, Double FISH 
showing S. kowalevskii wnt1 (green) expressed directly anterior to fgf8/17/18 
(magenta). f, S. kowalevskii otx expression. g, Mouse Otx2 expression. h, S. 
kowalevskii gbx expression. Asterisks denote endodermal domains. i, Mouse 
Gbx2 expression. j, Double FISH for S. kowalevskii otx (green) and gbx 
(magenta). k, Double FISH for S. kowalevskii wnt1 (green) and en (magenta). 
1, Double FISH for S. kowalevskii en and otx. m, Mouse En2 expression. n, S. 
kowalevskii pax2/5/8 expression. 0, Double FISH showing S. kowalevskii pax2/ 
5/8 and en expression. p-r, Expression of mouse Pax2 (p), Pax5 (q) and Pax8 


with en in the S. kowalevskii posterior collar (Fig. 3s) similar to verte- 
brates“, raising the possibility that this neuronal population is homo- 
logous to vertebrate midbrain dopaminergic neurons. 

Functional assays further support the deep deuterostome ancestry 
of a MHB-like genetic module. In vertebrates, fgf8 mediates the 
organizing abilities of the IsO and maintains expression of other 
MHB markers’*'**. To assess similar requirements for FGFs in hemi- 
chordates, we first treated embryos with 100 UM SU5402 at the end of 
gastrulation to suppress FGF signalling without perturbing earlier 
patterning events. Expression of en was strongly reduced in 
$U5402-treated embryos (Fig. 3u, v), whereas pax2/5/8 was unaffected 
(Fig. 3y, z), compared to control embryos treated with dimethyl- 
sulphoxide (DMSO). To test specifically for a role of fgf8/17/18 in 
regulating collar-trunk patterning, we injected fertilized oocytes with 
fef8/17/18 siRNA. Knockdown of fgf8/17/18 reduced en expression 
(Fig. 3w, x) but had no effect on pax2/5/8 expression (Fig. 3a’, b’) 
relative to embryos injected with a scrambled control siRNA. The 
absence of any effect on pax2/5/8 expression in these experiments 
highlights differences in gene regulation downstream of fgf8/17/18 
homologues between hemichordates and chordates*”°. 

In vertebrates, Wnt] is required to maintain expression of en genes 
at the MHB*”. To assess local Wnt functions at the S. kowalevskii 
collar-trunk boundary, we injected single blastomeres with B-catenin 
siRNA at early cleavage stages to suppress Wnt signalling in small 
patches of collar-trunk ectoderm. Clonal descendants of injected 
blastomeres failed to express en (Fig. 3c'—e’), suggesting a similar role 
for Wnts in regulating en genes at the MHB and collar-trunk boundary. 


Discussion 


Extensive similarities in expression patterns of signalling-centre markers 
and conserved functions for FGF, Wnt and Hh signals in S. kowalevskii 
and vertebrates provide compelling evidence that signalling centres 
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(r). s, Double FISH for S. kowalevskii tyrosine hydroxylase (green) and en 
(magenta). t, Summary of anteroposterior expression topologies in 
hemichordates and mice. Anterior to top. u, en expression is reduced in an 
embryo that has been treated with SU5402. v, en expression ina DMSO-treated 
control embryo. w, en expression in an embryo injected with fgf8/17/18 siRNA. 
x, en expression in an embryo injected with a control siRNA. y, pax2/5/8 
expression in an embryo treated with $U5402. z, pax2/5/8 expression in a 
DMSO-treated control embryo. a’, pax2/5/8 expression in an embryo injected 
with fgf8/17/18 siRNA. b’, pax2/5/8 expression in an embryo injected with a 
control siRNA. c’, en is not expressed in descendants of a blastomere injected 
with B-catenin siRNA. d’, Merged darkfield and fluorescence images showing 


the location of the B-catenin-deficient clone (green). e’, Wild-type en 
expression. Scale bars, 100 ttm in S. kowalevskii embryos; 1 mm in mice. 


homologous to the ANR, ZLI and IsO were parts of an ancient genetic 
regulatory scaffold that pre-date the morphological innovations of 
vertebrates (Fig. 4). Therefore, assembly of these genetic networks 
did not trigger morphological novelties of the brain. Instead, early 
vertebrate brain evolution involved modifying and elaborating ances- 
tral signalling centres to pattern novel structures within a highly con- 
served gene regulatory framework for anteroposterior ectodermal 
patterning’. 

Widespread losses of signalling-centre components in invertebrate 
chordates and the unresolved nature of the ancestral deuterostome 
nervous system present a challenge for inferring when the ANR, ZLI 
and IsO genetic circuits were first deployed to regulate CNS patterning 
specifically, and to what extent these integrations could have been 
associated with origins of vertebrate novelties. In S. kowalevskii, 
signalling-centre programs are deployed at stages when the nervous 
system is still circumferentially organized’ and in body regions that 
have been described as containing components of the adult peripheral, 
rather than central, nervous system in the hemichordate Ptychodera 
flava*’. Notably, similar to their roles in the vertebrate brain, the ZLI 
and IsO-like signalling centres are located at morphological boundaries 
in S. kowalevskii, suggesting ancient roles in demarcating ectodermal 
divisions. We propose that rather than having evolved for CNS pattern- 
ing, the most ancient role for signalling centres was in general body plan 
regionalization. 

This work also highlights that basal chordates have not retained 
all ancestral chordate characters and have undergone substantial 
independent evolutionary changes. This is generally accepted for 
urochordates, but based on available data, cephalochordates are often 
considered to be the most informative extant group for reconstructing 
ancestral chordate characters*’**. Our data provide clear evidence 
that secondary losses of complex developmental mechanisms have 
occurred in cephalochordates, and that overlooking this possibility 
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Figure 4 | Evolutionary gain and loss of ANR, ZLI and IsO-like genetic 
programs. Schematic diagrams depicting the expression of Fgf8, Sfrp1, Shh and 
Wnt1 homologues in the mouse brain and ectoderm of C. intestinalis, 
amphioxus and S. kowalevskii. Embryos are oriented with their anterior side to 
the left of the image and their dorsal side to top. Bar diagrams are oriented with 
the anterior side to the left of the image. Diagrams depict only expression 
domains that are related to signalling components of vertebrate CNS signalling 
centres. cv, cerebral vesicle; n, neck; nc, nerve cord; sv, sensory vesicle; vg, 
visceral ganglion. Diagrams are not to scale. Single asterisk indicates that Shh is 
expressed in the medial ganglionic eminence, near the ANR. Double asterisk 
indicates that sfrp1/5 is expressed in the C. intestinalis anterior ectoderm from 
the 64-cell stage up to neurulation but is then downregulated in the anterior 
ectoderm and CNS (shown as yellow stripes). 


can inflate the numbers of putative vertebrate novelties. Although 
divergence of ANR, ZLI and IsO-like genetic programs in extant 
invertebrate chordate lineages may have been associated with the loss 
or modification of anterior ectodermal structures of the CNS and 
head, this is difficult to test. On the basis of these new observations, 
re-analysis of early chordate fossils that accommodates scenarios of 
greater anterior complexity in stem chordates may be informative. 
However, in the absence of additional fossil data, predictions of the 
morphological consequences of developmental genetic losses based 
on molecular data alone are unreliable. 

This study provides a compelling example of the challenges asso- 
ciated with identifying key developmental genetic innovations 
responsible for morphological innovations at macroevolutionary 
scales. The unexpected presence of ANR, ZLI and IsO-like programs 
in S. kowalevskii highlights that basing outgroup choice solely on 
morphological criteria can lead to erroneous conclusions about links 
between morphological and developmental genetic characters: 
although by almost all morphological criteria amphioxus shares more 
similarities with vertebrates than do hemichordates, our data support 
the hypothesis that in certain cases hemichordates will be a more 
informative group than basal chordates for reconstructing stem 
chordate characters and understanding the origins of vertebrate 
developmental genetic processes. Additional data from protostomes, 
especially lophotrochozoans, will be required to assess whether the 
ANR, ZLI and IsO genetic programs are unique deuterostome fea- 
tures or have even deeper bilaterian origins. 

Our findings highlight the importance of broad phylogenetic 
sampling, including morphologically divergent outgroups, to identify 
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gene regulatory innovations responsible for evolutionary changes in 
body plans. With growing use of novel model organisms, it seems that 
secondary losses of complex developmental regulatory characters 
may occur commonly***’, with consequences for morphological 
evolution that are still poorly understood. Conversely, the presence 
of complex developmental modules that regulate morphologically dis- 
parate structures in distantly related lineages suggests a loose coupling 
between morphological and gene regulatory evolution over macro- 
evolutionary timescales, and highlights the difficulties of reconstruct- 
ing ancestral morphological characters from molecular genetic data. 


METHODS SUMMARY 


Gravid S. kowalevskii were collected at Waquoit Bay National Estuarine Research 
Reserve near Woods Hole, Massachusetts, and maintained at the Marine 
Biological Laboratory, Woods Hole. Spawning and embryo rearing was carried 
out using established techniques’. For SU5402 and cyclopamine treatments, 
embryos were raised either in inhibitor, or in an equivalent concentration of 
DMSO, diluted in 0.2 m-filtered sea water. siRNA microinjections were per- 
formed as described previously*’”’ using calcein or lysinated 10,000 molecular 
weight (MW) tetramethylrhodamine dextran tracers. Descendants of injected 
blastomeres were detected by anti-tetramethylrhodamine immunofluorescence. 
siRNA sequences are provided in Supplementary Table 1. Procedures involving 
mice (Mus musculus) were performed under a protocol approved by the University 
of Chicago Institutional Animal Care and Use Committee. Embryo fixations and 
colorimetric in situ hybridizations were performed using standard protocols”'. 
See Methods for fluorescent in situ hybridization (FISH) protocol. Photographs 
were taken using Zeiss Axiocam MRc5 or MRm cameras ona Zeiss Axiolmager.Z1 
ora Discovery.V 12. Images were acquired using Zeiss Axiovision 4.8 software and 
adjusted for colour balance and/or levels or gamma using Axiovision 4.8 or Adobe 
Photoshop CS3 or CS5 software. Images of experimental and control embryos 
were processed using the same parameters. S. kowalevskii homologues of verte- 
brate genes were identified in an expressed sequence tag (EST) library screen”. 
Amino acid sequences were aligned using ClustalW“, and putative homologues 
were confirmed by constructing gene trees using MrBayes 3.1.2 (refs 49,50) with 
parameters optimized for each gene (Supplementary Figs 4 and 5). See 
Supplementary Table 2 for accession numbers. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Embryo procurement. Gravid S. kowalevskii adults were collected from the 
intertidal zone of sandy estuaries near Woods Hole, Massachusetts, and main- 
tained in flow-through seawater tables at the Woods Hole Marine Biological 
Laboratory. Spawning and embryo rearing were carried out using established 
techniques’. Mouse embryos were obtained using standard protocols approved 
by the University of Chicago Institutional Animal Care and Use Committee. 
Experimental manipulations. SU5402 (Calbiochem) and _ cyclopamine 
(Calbiochem) were resuspended in minimal volumes of DMSO and diluted in 
0.2 um-filtered sea water for treatments. Embryos were raised at room temper- 
ature. Control embryos were obtained from the same batches and raised in an 
equivalent concentration of DMSO. siRNA microinjections were performed as 
described previously*””’. siRNAs were resuspended at 100 mM and injected at 
1/10 concentration in a suspension buffer’’ including either calcein or lysinated 
10,000 MW tetramethylrhodamine dextran (Molecular Probes). Unsuccessfully 
injected embryos, or embryos with abnormal cleavage patterns, were discarded. 
siRNAs were ordered from Ambion or Qiagen, and sequences are provided in 
Supplementary Table 1. 

In situ hybridizations. Colorimetric in situ hybridizations were performed as 
described previously’*'. Clonal descendants of injected blastomeres were iden- 
tified after in situ hybridization using anti-tetramethylrhodamine (Molecular 
Probes) immunofluorescence. For FISH, pre-hybridization steps were performed 
using our standard protocol. For double FISH, embryos were simultaneously 
hybridized with antisense RNA probes for both genes labelled separately with 
digoxigenin-11-UTP (Roche) or fluorescein-12-UTP (Roche). Owing to a 
reduced sensitivity of FISH, we typically used two to three times more probe 
compared to colorimetric methods. We used our standard protocol for post- 
hybridization washes and blocking steps. Embryos were then incubated in anti- 
digoxigenin-POD or anti-fluorescein-POD antibodies (Roche) diluted 1:200 in 
blocking solution (2% Roche blocking reagent, 1x MAB) for 4 h at room tem- 
perature on a rotary shaker. Embryos were washed four times in 1X MAB for 30 
min and once for 1 h at room temperature, or overnight at 4 °C. Embryos were 
then washed in 0.1 M imidazole (Sigma) in 1X PBS for 10 min at room temper- 
ature and probes were detected using a Tyramide Signal Amplification (TSA) Plus 
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kit (Perkin-Elmer). Embryos were rinsed in 1X amplification diluent and incu- 
bated in cyanine-3 or cyanine-5 tyramide diluted 1:50 in 1X amplification diluent 
for 45 min on a rotary shaker at room temperature. Embryos were washed three 
times in detergent solution (1% triton X-100, 1% SDS, 0.5% sodium deoxycholate, 
50 mM Tris-HCl pH 8.0, 1 mM EDTA pH 8.0, 150 mM sodium chloride) for 20 
min at 60 °C. When detecting a second gene, we washed the embryos once for 20 
min in solution X (50% formamide, 2 SSC and 1% SDS) at 60 °C. Embryos were 
then washed three times in 1X MAB for 5 min and returned to the blocking step. 
For genes with low signal under standard FISH methods, we performed an 
additional 2,4-dinitrophenol (DNP) amplification step. After primary antibody 
incubation and 1 MAB washes, embryos were incubated in DNP amplification 
reagent that was diluted 1:50 in 1X amplification diluent (Perkin-Elmer) for 5-10 
min at room temperature on a rotary shaker. Embryos were washed four times in 
1X MAB for 30 min and once for 1 h at room temperature, re-blocked for 1 h and 
incubated in anti-DNP-HRP antibody (Perkin-Elmer) that was diluted 1:200 in 
blocking solution for 4 h at room temperature on a rotary shaker. Embryos were 
washed four times in 1X MAB for 30 min and once for 1 h at room temperature. 
Expression was then detected using a TSA Plus kit (Perkin-Elmer) as described 
above. 
Photography. Embryos were photographed using Zeiss Axiocam MRc5 or MRm 
cameras on Zeiss AxioImager.Z1 or Discovery.V12 microscopes. For optical 
clearing, S. kowalevskii embryos were dehydrated into methanol, cleared using 
Murray’s clearing reagent (1:2 benzyl alcohol to benzyl benzoate) and mounted in 
permount (Fisher Scientific). Cleared embryos were imaged under differential 
interference contrast optics and uncleared embryos were photographed in 1X 
PBS or 1X MAB on agarose-coated dishes. Images were acquired using Zeiss 
Axiovision 4.8 software and adjusted for colour balance and/or levels or gamma 
using Zeiss Axiovision 4.8 or Adobe Photoshop CS3 or CS5 software. Images of 
experimental and control embryos were processed using the same parameters. 
Gene identification. S. kowalevskii homologues of vertebrate genes were iden- 
tified in an EST library screen’, amino acid sequences were aligned using 
ClustalW* and putative homologues were confirmed by constructing gene trees 
using MrBayes 3.1.2 (refs 49,50) with parameters optimized for each gene 
(Supplementary Figs 4 and 5). See Supplementary Table 2 for accession numbers. 
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Genome-wide structure and organization 
of eukaryotic pre-initiation complexes 


Ho Sung Rhee! & B. Franklin Pugh! 


Transcription and regulation of genes originate from transcription pre-initiation complexes (PICs). Their structural and 
positional organization across eukaryotic genomes is unknown. Here we applied lambda exonuclease to chromatin 
immunoprecipitates (termed ChIP-exo) to examine the precise location of 6,045 PICs in Saccharomyces. PICs, 
including RNA polymerase II and protein complexes TFIIA, TFIIB, TFIID (or TBP), TFIIE, TFIIF, TFITH and TFIIK were 
positioned within promoters and excluded from coding regions. Exonuclease patterns were in agreement with 
crystallographic models of the PIC, and were sufficiently precise to identify TATA-like elements at so-called 
TATA-less promoters. These PICs and their transcription start sites were positionally constrained at TFIID-engaged 
downstream +1 nucleosomes. At TATA-box-containing promoters, which are depleted of TFIID, a +1 nucleosome was 
positioned to be in competition with the PIC, which may allow greater latitude in start-site selection. Our genomic 
localization of messenger RNA and non-coding RNA PICs reveals that two PICs, in inverted orientation, may occupy the 
flanking borders of nucleosome-free regions. Their unambiguous detection may help distinguish bona fide genes from 


transcriptional noise. 


Assembly of the PIC and its post-assembly control are critical early 
steps in the transcription of eukaryotic genes. TBP (TATA-binding 
protein) arrives at most promoters as part of the multi-subunit TFIID 
complex that includes TBP-associated factors (TAFs)’. Together these 
proteins help recruit RNA polymerase (Pol) II and its entourage of 
general transcription factors (GTFs) to the transcription start sites 
(TSSs) of genes” *. These PICs assemble in nucleosome-free promoter 
regions (NFRs) that are flanked by an upstream —1 nucleosome anda 
downstream +1 nucleosome’. PICs have largely been defined bio- 
chemically using purified GTFs at a few model genes*~, but little is 
known about their assembly and organization in vivo, particularly at 
near-base-pair resolution on a genome-wide scale. 

An oddity of TBP is that when it is part of the TFIID complex, it 
tends to bind promoters that lack the TATA box consensus 
TATAWAWR (W indicates A/T; R indicates A/G)*. Approximately 
80-90% of all Saccharomyces genes are thus designated as “TATA- 
less’, and have a predominant PIC assembly mechanism and chro- 
matin architecture that differs substantially from those in the “TATA 
box’ class of genes”. So far, no TBP-binding motif has been identified 
at TATA-less promoters, and so the origins of TFIID-promoter 
specificity have been rather enigmatic'®. When TBP is not part of 
the TFIID complex, the SAGA complex directs TBP to TATA-box- 
containing Pol II promoters'!’. 

TFIA and TFIIB clamp TBP to DNA, and make DNA contacts 
immediately upstream and downstream of the TATA box. TFIIB is a 
linchpin between TBP and Pol II'*"». Its intimate contact with Pol II 
directs how far downstream Pol II productively initiates transcrip- 
tion'®'’. TFIIF enhances the interaction of Pol II with TFIIB, assists 
in recruiting TFIIE, and promotes downstream elongation events*"*. 
TFIE then stimulates DNA strand separation by Pol II at the TSS, and 
enhances the activity of TFITH. TFIIH holoenzyme is multi-functional, 
having ATP-dependent helicase (Ssl2 and Rad3) and kinase (Kin28) 
activities that reside on biochemically separable sub-complexes (TFITH 
and TFIIK, respectively), both of which are key to efficient open com- 
plex formation and transcription initiation’. 


Weexamined the structural organization of PICs and their specificity 
on a genomic scale using ChIP-exo”’. This novel strategy substantially 
improved mapping resolution and eliminated many false positives. The 
exonuclease processively degrades a DNA strand in the 5'~3' direction 
until a crosslinking point is encountered (Supplementary Fig. 1a). The 
crosslinking inefficiency inherent to ChIP allows multiple crosslinking 
points to be detected in a population by deep sequencing. When applied 
to the GTFs on a genomic scale, we obtained detailed and comprehen- 
sive information on PIC structure and genomic organization. 


Genome-wide PIC structure 
We applied ChIP-exo genome-wide to Pol II and each GTF (Fig. 1a), 
and verified binding for TFIIB by locus-specific PCR using a series of 
tiled primers (Supplementary Fig. 1b). When exonuclease stop sites 
were mapped over all annotated mRNA promoters that contained a 
TATA box consensus, a distinctive pattern was observed (Fig. 1b). 
Importantly, each GTF displayed a strand-specific composite pattern 
of exonuclease stop sites that occurred only when TATA boxes, but 
not TSSs (Supplementary Fig. 2a and data not shown), were aligned, 
indicating that PICs are positioned with respect to the TATA box. 

For TFIIB, we detected four DNA crosslinking points (pairs of 
exonuclease stops), designated B1-B4 (Fig. 1c). The TATA box was 
precisely centred between B1 and B2, which were separated by 20 + 3 
base pairs (bp). Crosslinking point B3 and the diffuse B4 region indi- 
cate that TFIIB further crosslinked over a broad region downstream of 
the TATA box towards the TSS. We compared the four TFIIB cross- 
linking points to crystallographic-based models of open and closed 
TBP-TFIIB-Pol-II-promoter complexes (Fig. 1c)'*’°. An important 
caveat of the crystallographic models is they were built from multiple 
independent structures of truncated TFIIB-TBP-TATA, TFIIB-Pol 
II and Pol II elongation complexes. Thus, the combined structures 
represent a hypothetical organization. 

Within the modelled closed and open structures, crosslinking site 
B1 precisely (+ 3 bp) mapped to where the TFIIB carboxy-terminal 
core straddles the upstream DNA-binding stirrup of TBP (11 bp 
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Figure 1 | Genome-wide structural organization of PICs. a, Raw ChIP-exo 
tag distribution for GTFs and Pol II around the RPS14B gene. Filled plots 
represent unfiltered 5’ ends of sequencing tags on the sense (darker fill) and 
antisense strand (lighter fill). b, Average GTF and Pol II occupancy around the 
TATA box of 676 annotated mRNA genes. Plots are as in panel a. 


upstream of the TATA box midpoint). B2 mapped precisely (+ 3 bp) 
to where the TFIIB core amino-terminal cyclin fold encounters DNA 
just downstream of TBP’s other stirrup (9 bp downstream of the TATA 
box midpoint). B3 mapped to where the TFIIB linker region is closest to 
the DNA, which was 19 bp downstream of the TATA box midpoint. B4 
corresponded to a broad region defined by close proximity of the TFIIB 
reader (or finger) domain to single-stranded DNA within the modelled 
open complex, but was not evident within the closed complex. Similar 
broad regions of crosslinking were observed with the other GTFs 
(Fig. 1b), and may reflect indirect crosslinking. Support that these 
PICs represent open complexes is provided by permanganate reactivity 
studies of the GAL1, GAL10 and HSP82 loci****. Taken together, the 
entirety of the genomic crosslinking sites observed with the GTFs and 
Pol II fits remarkably well with the crystallographic models of the PIC 
open complex'*”*, and with many aspects of in vitro chemical cross- 
linking of these proteins'’??*”’. 


TATA-like elements at TATA-less genes 

An apparent paradox of so-called TATA-less promoters is their 
utilization of TFIID, which has long been described as the TATA- 
box-binding complex*. However, it is unclear whether the TBP subunit 
of TFIID recognizes specific DNA sequences at TATA-less promoters. 
Inasmuch as TBP is expected to be found at all TATA-less promoters, 
and motif searching algorithms failed to identify candidate TBP-binding 
sites, we instead opted to search for sequence elements with up to two 
mismatches to the TATAWAWR consensus. We also limited our search 
to measured PIC locations. Remarkably, 99% of the PICs at TATA-less 
promoters contained a sequence having two or less mismatches to the 
TATA box consensus (Fig. 2). We refer to these mismatched elements as 
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models of the PIC’. The top panel is expanded from panel b for TFIIB. The 
middle and bottom panels show modelled open and closed TBP-TFIIB-Pol- 
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Figure 2 | Identification of TATA-like elements at TATA-less genes. Left, 
TFIIB occupancy around individual TFIIB-enriched TATA elements of mRNA 
genes (rows, n = 3,945), sorted by occupancy level. Occupancy on the sense 
(blue) and antisense (red) strands is shown with respect to TSS orientation. 
Right, a colour chart representation of the DNA sequence located + 20 bp from 
the TATA element midpoint and ordered as shown in the left panel. 
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“TATA-like’, as they did not form a consensus, whereas those conform- 
ing to the consensus retain the “TATA box’ designation. We refer to the 
two elements together as “TATA elements’. 

To assess whether TFIIB was positioned around these TATA-like 
elements in a canonical manner as seen at bona fide TATA boxes, 
strand-specific ChIP-exo tags were plotted around each element, sepa- 
rated into panels by 0, 1 or 2 mismatches to the TATA box consensus 
(Fig. 2). Notably, regardless of its occupancy level, the distribution of 
TFIIB crosslinking and thus its positioning relative to these TAT A-like 
elements was quite similar to the positioning observed at bona-fide 
TATA boxes. When the other GTFs were examined, their patterns 
relative to TATA-like elements were also similar to those found at 
TATA boxes (Supplementary Fig. 2b), although some downstream 
differences were observed (addressed later). Thus, as previously seen 
at the three yeast TATA-box-containing genes GALI, GAL10 and 
HSP82 (refs 23, 24), and in mammalian in vitro systems”®, at least the 
upstream portion of most PICs are positioned with respect to resident 
TATA elements nearly identically, regardless of their Pol II promoter 
classification as TATA-box containing or TATA-less. Although the 
‘TATA-less’ designation may be a misnomer, this class of genes is not 
simply a slight variation of the TATA class, but instead has predom- 
inant regulation by TFIID versus SAGA, positive versus negative regu- 
lation by chromatin, and lower plasticity of expression®”. 


TATA-less TSS positioning by nucleosomes 

Permanganate reactivity experiments detect open complex formation 
upstream of the TSS at the GALI, GAL10 and HSP82 TATA-box- 
containing promoters****. These and other studies'””’ have led to the 
notion that Pol II scans downstream from the open complex to find 
the TSS. In agreement with this being a general mechanism, we find 
that PICs of TATA-box-containing genes generally reside upstream 
of the TSS (Supplementary Fig. 2a). 

TAFs are largely depleted at TATA-box-containing promoters, 
although they are not entirely absent (Fig. 3a). The low level of Tafl 
that was present tended to be positioned similarly to TBP and other 
GTFs (Fig. 1b and Supplementary Fig. 2a, b). In contrast, Tafl- 
enriched/TATA-less promoters (which are related, as shown in 
Supplementary Fig. 3) showed additional interactions downstream 
of the TSS that exactly coincided with the size and location of the 
+1 nucleosome (Fig. 3a). Indeed, Tafl displayed a more uniform 
positioning pattern in relation to the TSS and +1 nucleosome than 
to TATA elements, which suggests that at least part of the TFIID 
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TAF complex engages and is positioned by the +1 nucleosome at 
TATA-less promoters. Consistent with this, Bdf1, which is considered 
to be a missing piece of Tafl (ref. 30), binds to the +1 nucleosome”. 
Furthermore, Bdfl1 showed a nearly identical ChIP-exo pattern to that 
of Tafl (Supplementary Fig. 4). TFI[D-nucleosomal interactions 
have also been reported in mammalian systems, although the details 
may differ’. 

If TFIID binds simultaneously to both the +1 nucleosome and a 
TATA element, then the intervening Pol II would seem to be fenced in 
by TFIID, thereby limiting its ability to scan DNA. This model pre- 
dicts that the TSS would reside closer to the TATA element and be 
positionally restricted relative to the +1 nucleosome, compared to 
Tafl-depleted/TATA-box-containing promoters. Indeed, the TSS at 
TATA-less promoters resided ~ 10-20 bp closer to the TATA element 
than at TATA-box-containing promoters (Supplementary Fig. 2a). 

We also compared the position of TATA elements and TSSs in 
relation to the +1 nucleosome. We separately examined individual 
Tafl-depleted/TATA-box-containing and Tafl-enriched/TATA-less 
promoters (Fig. 3b). Strikingly, at the Tafl-enriched promoters, 
the TSS was tightly positioned at the border between the 5’ NFR 
and the +1 nucleosome in comparison to Tafl-depleted/TATA- 
box-containing promoters. The latter had TSSs distributed across 
the adjacent nucleosome position, and these nucleosomes were rela- 
tively depleted compared to the Tafl-enriched class (Fig. 3a). 

Taken together, we interpret these observations as reflecting distinct 
functions of the + 1 nucleosome at the two classes of genes. Nucleosomes 
and PICs of the TFIID-enriched/TATA-less class might cooperatively 
assembly, in which case the nucleosome may be instructive for TSS 
selection by impeding Pol II scanning (Fig. 3c). In contrast, nucleosomes 
and PICs of the Tafl-depleted/TATA-box-containing class may com- 
petitively assemble. This may allow for the greater stochasticity or plas- 
ticity of expression that is characteristic of this class’, in which 
nucleosome loss would prime the gene for a high level of transcription. 
A nucleosome competition mechanism removes an impediment to Pol 
II scanning. Pol II could scan further, thereby allowing productive ini- 
tiation at specific DNA elements”. The transition into a scanning state 
may be rate limiting in the scanning cycle, as the PIC is detected 
upstream of the TSS. 


GTF depletion in genes and their termini 


Although it is clear that GIFs assemble in promoter regions of mRNA 
genes and disengage Pol II within ~150 bp of the TSS**”, it is less 
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Figure 3 | PIC organization in relation to TFIID and the +1 nucleosome. 
a, GTF occupancy around the nearest nucleosome position (essentially +1) to 
an mRNA PIC, which were sorted by the distance between the two. Unfiltered 
tags on each strand were shifted in the 3’ direction by a fixed distance (~8 bp 
depending on each GTF, 73 bp for nucleosomes), so as to reflect better the 
points of crosslinking. Tafl-depleted (Tafl-depl.) and Tafl-enriched genes 
were determined as being distinct clusters when GTF occupancies of all genes 
were clustered by k-means (see Fig. 5a). For all graphs of this type, image 


resolution is less than the number of rows, resulting in some averaging and thus 
the appearance of less variance across adjacent rows. See Supplementary Data 2 
for underlying values, which can be visualized in Treeview. RP indicates 
ribosomal protein genes. The right panel shows transcription frequency**. The 
nucleosome borders are denoted by vertical dashed black lines. b, Same as panel 
a, but showing an overlay of TATA elements, TFIIB and TSS. c, Model of PIC 
organization at TATA-box-containing and TATA-less/TFIID-dependent 
genes. 
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clear to what extent they assemble across the body of genes, or at genes 
that are either transcriptionally silent or classified as ‘dubious’. A 
whole genome view of GTFs and Pol II is presented in Fig. 4a. 
Remarkably, PICs were almost entirely excluded from coding regions, 
regardless of gene activity, whereas Pol II was enriched across gene 
bodies, as expected. Approximately 90% of dubious open reading 
frames (ORFs) lacked a canonical PIC organization or contained PICs 
within the ORF, and thus are unlikely to be coding. Thus, coding region 
PIC-driven initiation, whether in the sense or antisense direction, is 
infrequent on the scale of what is seen at mRNA promoters. 
Moreover, the observed GTF pattern makes clear that Pol II disengages 
all GTFs at the promoter. 

Much less is known of the fate of Pol II at the ends of genes, as it 
undergoes termination. To examine the 3’ ends of genes, without the 
complications associated with nearby mRNA promoters, we sepa- 
rated 3’ ends into those having nearby 3’ or 5’ ends of an adjacent 
gene (Fig. 4b). Within terminal intergenic regions, GIFs were highly 


depleted, indicating that PICs rarely exist at the 3’ ends of genes at 
levels seen for mRNA genes (although lower levels do exist). 

Remarkably, we find a highly correlated enrichment of Pol II and 
TFHH (Ssl2) but not TFIIK (Kin28), at the end of genes within 3’ 
NFRs (Fig. 4b). Such a physical separation of the TFIIH/Ssl2 and 
TFIHK/Kin28 submodules of holo-TFIIH in genome-wide binding 
experiments has not previously been reported, but may be in accord 
with their biochemistry’**'. However, Ssl2 is biochemically separable 
from the TFHH core, which therefore prompted us to examine 
additional core TFIIH subunits, Ssll and Tfb1. Surprisingly, both 
were absent from the ends of genes (Supplementary Fig. 6), although 
they were present within PICs at promoters. These results suggest that 
Ssl2, a 3'-5’ helicase, operates independently of holo-TFIIH at the 
ends of genes. Consistent with a possible role of Ssl2 in transcription 
termination, Ssl2 has functional interactions with the Hsp90 protein 
chaperone**’, which has been implicated in the disassembly of the 
transcription machinery”. 
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Figure 4 | Genomic view of PICs in relation to genes. a, GIF occupancy 
around transcript and ORF start sites’’, sorted by gene length. See 
Supplementary Data 3-5 for underlying values. Transcript or ORF ends are 
indicated by black dashed and solid lines, respectively. The right panel shows 
transcription frequency**. b, GTF distribution around the 3’ ends of mRNA 
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Distance from annotated TSS (kb) 


genes, sorted by intergenic length, and sectioned by convergent versus tandem 
gene-pairs. Occupancy at eight reported looped genes”? is shown in 
Supplementary Fig. 5. c, GIF distribution around the TSS of mRNA genes, 
sorted by intergenic length, and sectioned by inverted versus tandem gene- 
pairs. 
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Divergent transcription from distinct PICs 

In contrast to coding regions, PICs were abundant in intergenic 
regions, far beyond what could be accounted for at mRNA promoters 
(Fig. 4c). Divergent transcription, in which mRNA and non-coding 
(nc)RNA initiation occurs within the same region but elongates in 
opposite directions, is well established in eukaryotes*®*'. However, it 
has been unclear whether divergent transcription originates from the 
same PIC site. Conceivably, the entire PIC or a portion thereof might 
assemble in either direction. As shown in Fig. 4c, even the shortest 
(~120 bp) 5’ intergenic regions of mRNA genes with inverted orienta- 
tion were associated with two PICs, one for each mRNA direction. 
Thus, divergent mRNA transcription originates from two distinct 
PICs, even when arising from the same NFR. 

We next examined the composition and location of PICs associated 
with mRNA and ncRNA (variously classified as cryptic unstable tran- 
scripts (CUTs), stable uncharacterized transcripts (SUTs), Xrn1- 
sensitive unstable transcripts (XUTs) and antisense (AS))**“*. We 
also examined orphan PICs, which we defined as being >160 bp from 
any annotated TSS or ORF start site. Nearly all had the same relative 
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composition of GTFs, including Tafl depletion or enrichment 
(Fig. 5a), although mRNA PICs generally had higher occupancy 
levels. GIFs had highly correlated occupancies at all PICs (see Sup- 
plementary Fig. 7 for mRNA PICs). ncRNA PICs were generally 
organized around an adjacent nucleosome, as seen for mRNA PICs 
(Supplementary Fig. 8). Thus, all mRNA, ncRNA and orphan PICs 
are compositionally homogeneous with regards to the GTFs (exclud- 
ing TAFs). 

To visualize better the context of the low-occupancy ncRNA PICs 
with mRNA genes, we marked ncRNA PIC locations by their TATA 
element, and plotted their directionality with respect to nearby 
mRNA (Fig. 5b). We observed a general trend where ncRNA and 
mRNA PICs were positioned in opposite directions 150-200 bp apart. 
This places both PICs within the same NFR, and thus within the same 
canonical nucleosome architecture, as seen for two divergent mRNA 
PICs that share the same NFR (Supplementary Fig. 9). 

Low-occupancy ncRNA PICs were also found towards the 3’ ends 
of mRNA genes, of which the majority were antisense to the mRNA 
(Fig. 5b), and associated with low expression of the sense mRNA 
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Figure 5 | Distribution of ncRNA PICs. a, GTF occupancy levels at PICs for 
mRNA, ncRNA and orphans, sectioned by k-means clustering. Occupancy 
levels of each GTF were median normalized, log, transformed, then sorted by 
row median. The small M1 group represents a terminating polymerase 
originating from upstream. b, Distribution of ncRNA PICs relative to mRNA 
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genes. mRNA genes were filtered to retain only those having a nearby ncRNA- 
associated PIC (defined as having >10% of the genome-wide TFIIB average). 
Plotted are the locations of TATA elements associated with the mRNA (grey), 
sense-directed ncRNA (blue), and antisense-directed ncRNA (red). Additional 
plot details are as described in Fig. 4. 
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(Supplementary Fig. 10a). Thus, ncRNAs, which tend to be anti- 
sense”, are generally associated with repression when residing in gene 
bodies. 

In total, we identified ~6,000 PICs in rapidly growing yeast cells, in 
which the PICs had an occupancy level of >10% of the genome 
average. More than 98% of these PICs had a TATA element precisely 
where TBP bound. Approximately 70% of the identified PICs were 
associated with mRNA genes (Supplementary Fig. 10b). The remain- 
ing ~30% were divided evenly between ncRNA and orphans. At 
lower detection thresholds, many more low-occupancy PICs could 
be identified. We do not believe that they represent technical noise, as 
they are highly enriched in NFRs in which mRNA and ncRNA PICs 
are found. They might produce low levels of promoter-specific basal 
transcription. 


Unifying principles of PICs 


Our data suggest that with the exception of TAFs, PICs are compo- 
sitionally homogeneous in regards to GTFs at coding and non-coding 
Pol II transcription units in the yeast genome. PICs differ markedly in 
occupancy levels, which is in accord with their transcription fre- 
quency. PICs tend to form at NFR/nucleosome borders at the 5’ 
end (and to some extent at the 3’ end) of genes, where they direct 
either mRNA or ncRNA transcription away from the NFR. As such, 
an NFR may normally accommodate two divergently oriented PICs at 
markedly different occupancy and transcription levels. These occu- 
pancy levels do not strictly correlate, which suggests largely independ- 
ent control of the two PICs. 

PIC assembly, orientation and positioning may be contributed to in 
part by the resident TATA element, as well as through sequence- 
specific factors and co-factors. TFIID-regulated promoters may rely 
less on TATA element strength, and more on an NFR-adjacent 
nucleosome for PIC assembly, orientation and positioning. The adja- 
cent nucleosome might also serve to impede a scanning Pol II so that it 
can productively select a TSS at a focused position just inside the 
nucleosome. At SAGA-regulated promoters, which tend to contain 
a consensus TATA box, nucleosome occupancy may be more com- 
petitive with PIC assembly, wherein the strength of TBP/TATA inter- 
actions would be more important for PIC assembly, orientation and 
positioning. As such, there would be no nearby nucleosome to impede 
polymerase scanning, which would allow TSS selection to be con- 
trolled by other factors, including DNA sequence. 

The emergent concept of ncRNA and the difficulty of distinguishing 
random transcriptional noise from specific initiation raise the question 
as to what constitutes a gene*. The unambiguous and precise mapping 
of PIC locations across a genome, as described here, might help define 
the start of individual genes. 


METHODS SUMMARY 


Saccharomyces strains (BY4741) bearing TAP-tagged GTF or Pol II subunits (or 
untagged TBP) were grown to exponential phase in yeast extract peptone dextrose 
(YPD) media (30°C to OD¢oonm = 9.8), then subjected to 1% formaldehyde 
crosslinking for 15min at 25°C. Cells were harvested and washed. Sonicated 
chromatin was prepared by standard methods. Standard ChIP methods were 
used, followed by lambda exonuclease treatment and library construction, as 
described elsewhere”. Libraries were sequenced by an ABI SOLiD sequencer. 
Figures displaying strand-specific sequencing tags represent the raw data without 
normalization to input. TFIIB peak calls were made with GeneTrack software**. 
PICs (m = 6,045) were identified as having a TFIIB peak-pair in at least two out of 
four biological replicates and having =33 sequence tags (> 10% of average TFIIB 
occupancy)”. PICs were assigned to the nearest TSS within + 200 bp, with 
mRNA” having precedence over ncRNA. For this purpose, ORFs lacking a 
TSS (from the Saccharomyces Genome Database) were assigned a hypothetical 
TSS based on the genome-wide consensus. PICs of ncRNA were assigned to the 
nearest TSS within + 200 bp of SUTs, CUTs, ASs and XUTs*™*, with SUTs/ 
CUTs having precedence over AS/XUTs. To assign directionality to orphan PICs, 
we compared nucleosome occupancy on the lower versus higher coordinate side 
of TFIIB locations. If the higher coordinate had higher nucleosome occupancy it 
was classified as ‘sense’, otherwise it was ‘antisense’. We validated this method by 
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applying it to mRNA PICs, and found >91% of the assignments to be correct. We 
searched for TATA elements between —80 to +20 bp of the midpoint of 6,045 
TFIIB-bound locations on the sense strand, first by searching for the consensus 
TATAWAWR, then for one and then two mismatches to the consensus. The 
TATA element closest to —28 bp of a TFIIB peak had precedence if multiple 
elements were found. 
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Creating, moving and merging Dirac points with a 
Fermi gas in a tunable honeycomb lattice 


Leticia Tarruell', Daniel Greif!, Thomas Uehlinger', Gregor Jotzu' & Tilman Esslinger’ 


Dirac points are central to many phenomena in condensed-matter 
physics, from massless electrons in graphene to the emergence of 
conducting edge states in topological insulators’’. At a Dirac 
point, two energy bands intersect linearly and the electrons behave 
as relativistic Dirac fermions. In solids, the rigid structure of the 
material determines the mass and velocity of the electrons, as well 
as their interactions. A different, highly flexible means of studying 
condensed-matter phenomena is to create model systems using 
ultracold atoms trapped in the periodic potential of interfering 
laser beams**. Here we report the creation of Dirac points with 
adjustable properties in a tunable honeycomb optical lattice. Using 
momentum-resolved interband transitions, we observe a minimum 
bandgap inside the Brillouin zone at the positions of the two Dirac 
points. We exploit the unique tunability of our lattice potential to 
adjust the effective mass of the Dirac fermions by breaking inver- 
sion symmetry. Moreover, changing the lattice anisotropy allows us 
to change the positions of the Dirac points inside the Brillouin zone. 
When the anisotropy exceeds a critical limit, the two Dirac points 
merge and annihilate each other—a situation that has recently 
attracted considerable theoretical interest’ but that is extremely 
challenging to observe in solids’®. We map out this topological 
transition in lattice parameter space and find excellent agreement 
with ab initio calculations. Our results not only pave the way to 
model materials in which the topology of the band structure is 
crucial, but also provide an avenue to exploring many-body phases 
resulting from the interplay of complex lattice geometries with 
interactions’. 

Ultracold gases in optical lattices have become a versatile tool with 
which to simulate a wide range of condensed-matter phenomena”. 
For example, the control of interactions has led to the observation of 
Mott insulating phases’*’*. In fermionic systems, this provides new 
access to the physics of strongly correlated materials. However, the 
topology of the band structure is equally important for the properties of 
a solid. A prime example is the honeycomb lattice of graphene, where 
the presence of topological defects in momentum space—the Dirac 
points—leads to remarkable transport properties, even in the absence 
of interactions’. In quantum gases, a honeycomb lattice has recently 
been realized and investigated using a Bose-Einstein condensate’, 
but no signatures of Dirac points were observed. Here we study an 
ultracold Fermi gas of 40K atoms in a two-dimensional, tunable optical 
lattice, which can be continuously adjusted to create square, triangular, 
dimer and honeycomb structures. In the honeycomb lattice, we 
identify the presence of Dirac points in the band structure by observing 
a minimum bandgap inside the Brillouin zone using interband transi- 
tions. Our method is closely related to a technique recently used with 
bosonic atoms to characterize the linear crossing of two high-energy 
bands in a one-dimensional, bichromatic lattice'’, but also provides 
momentum resolution. 

To create and manipulate Dirac points, we have developed a two- 
dimensional optical lattice of adjustable geometry. It is formed by three 
retro-reflected laser beams of wavelength 2 = 1,064 nm, arranged as 
depicted in Fig. 1a. The interference of two perpendicular beams, X 


and Y, gives rise to a chequerboard lattice of spacing 4/./2. A third 
beam, X, collinear with X but detuned by a frequency 6, creates an 
additional standing wave with a spacing of 1/2. This yields a potential 


of the form 


Dimer 


a b Chequerboard Triangular 


Eg | |W 


qy 
e Dirac points 
Ox 


Figure 1 | Optical lattice with adjustable geometry. a, Three retro-reflected 
laser beams of wavelength 4 = 1,064 nm create the two-dimensional lattice 
potential of equation (1). Beams X and Y interfere and produce a chequerboard 
pattern, and beam X creates an independent standing wave. Their relative 
position is controlled by the detuning 6. b, Top: different lattice potentials can 
be realized depending on the intensities of the lattice beams. White regions 
correspond to lower potential energies and blue regions to higher potential 
energies. Bottom: diagram showing the accessible lattice geometries as a 
function of the lattice depths Vx and Vx. The transition between triangular (T) 
and dimer (D) lattices is indicated by a dotted line. When crossing the dashed 
line into the honeycomb (Hc) regime, Dirac points appear. The limit Vg>>Vx, 
Vx>>Vy corresponds to weakly coupled, one-dimensional chains (1D c). ¢, The 
real-space potential of the honeycomb lattice has a two-site unit cell (sites A and 
B) and the primitive lattice vectors are perpendicular. d, Left: sketch of the first 
and second Brillouin zones (BZs) of the honeycomb lattice, indicating the 
positions of the Dirac points. Right: three-dimensional view of the energy 
spectrum showing the linear intersection of the bands at the two Dirac points. 
The colour scale illustrates lines of constant energy. We denote the full 
bandwidth, W; the minimum energy gap at the edges of the Brillouin zone, Eg; 
and the Bloch wavevector, gg = 27/A. 
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V(x, y) 
= — Vgcos*(kx + 0/2) — Vxcos*(kx) — Vycos?(ky) (1) 


— 2a/ Vx Vycos(kx)cos(ky)cos(g) 


where Vx, Vx and Vy denote the single-beam lattice depths (propor- 
tional to the laser beam intensities), « is the visibility of the interference 
pattern and k = 2n//. We can adjust the two phases continuously, and 
choose 0 = 1 and y = 0 (Methods). Varying the relative intensities of 
the beams allows us to realize various lattice structures (Fig. 1b). In the 
following, we focus on the honeycomb lattice, whose real-space poten- 
tial is shown in Fig. 1c. 

The honeycomb lattice consists of two sublattices, A and B. Therefore, 
the wavefunctions are two-component spinors’. Tunnelling between 
the sublattices leads to the formation of two energy bands, which are 
well separated from the higher bands and have a conical intersection at 
two quasi-momentum points in the Brillouin zone—the Dirac points. 
These points are topological defects in the band structure, with 
respective associated Berry phases of t and —1. This guarantees their 
stability with respect to lattice perturbations, such that a large range of 
lattice anisotropies change only the positions of the Dirac points inside 
the Brillouin zone. In contrast, breaking the inversion symmetry of the 
potential by introducing an energy offset, 4, between the sublattices 
opens an energy gap at the Dirac points, proportional to 4. In our 
implementation, 4 depends only on the value of the phase @ and can be 
precisely adjusted (Methods). As shown in Fig. Ic, d, the primitive 
lattice vectors are perpendicular, leading to a square Brillouin zone 
with two Dirac points inside. Their positions are symmetric around the 
centre and are fixed to quasi-momentum q, = 0, owing to the time- 
reversal and reflection symmetries of the system”. The band structure 
for our lattice implementation is in the two lowest bands topologically 
equivalent to that of a hexagonal lattice with six-fold symmetry. For 
deep lattices, both configurations then also map to the same tight- 
binding Hamiltonian. 

We characterize the Dirac points by probing the energy splitting 
between the two lowest-energy bands through interband transitions. 
The starting point of the experiment is a non-interacting, ultracold gas 
of N~50,000 fermionic “°K atoms in the |F, mp) = |9/2, —9/2) state, 
where F denotes the hyperfine manifold and mp the Zeeman state. The 
cloud is prepared in the lowest-energy band of a honeycomb lattice 
with Vx/Ex=4.0(2), Vx/Ex=0.28(1) and Vy/E=1.8(1), which 
also causes a weak harmonic confinement with trapping frequencies 
@,/2m = 17.6(1) Hz, w,/2n = 31.8(5)Hz and w,/2n = 32.7(5) Hz. 
Here Eg = h*/2m/? is the recoil energy, h denotes Planck’s constant 
and m is the mass ofa *°K atom. Throughout the manuscript, errors in 
parenthesis denote the standard deviation. On application of a weak 
magnetic field gradient, the atomic cloud is subjected to a constant 
force, F, in the x direction, with an effect equivalent to that produced by 
an electric field in solid-state systems. The atoms are hence accelerated 
such that their quasi-momentum q, increases linearly up to the edge of 
the Brillouin zone, where a Bragg reflection occurs. The cloud even- 
tually returns to the centre of the band, performing one full Bloch 
oscillation®’. We then measure the quasi-momentum distribution of 
the atoms in the different bands** (Methods). 

Owing to the finite momentum width of the cloud, trajectories with 
different quasi-momenta q, are simultaneously explored during the 
Bloch cycle (Fig. 2a). For a trajectory far from the Dirac points, the 
atoms remain in the lowest-energy band (trajectory 1). In contrast, 
when passing through a Dirac point (trajectory 2), the atoms are 
transferred from the first band to the second because of the vanishing 
energy splitting at the linear band crossing. When measuring the 
quasi-momentum distribution, these atoms are missing in the first 
Brillouin zone and appear in the second band (Fig. 2a). We identify 
the points of maximum transfer with the Dirac points. The energy 
resolution of the method is set by the characteristic energy of the 
applied force”, Ep/h = FA/2h = 88.6(7) Hz, which is small compared 
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Figure 2 | Probing the Dirac points. a, Quasi-momentum distribution of the 


atoms before and after one Bloch oscillation, of period Tx (colour scale, column 
density of the absorption image in arbitrary units). The cloud explores several 
trajectories in quasi-momentum space simultaneously. For trajectory 1 (blue 
filled circle), the atoms remain in the first energy band. In contrast, trajectory 2 
(green open circle) passes through a Dirac point at t = T;/2. There the energy 
splitting between the bands vanishes and the atoms are transferred to the 
second band. When measuring the quasi-momentum distribution at t = Tp, 
these atoms are missing from the first Brillouin zone and appear in the second 
one. b, Dependence of the total fraction of atoms transferred to the second 
band, €, on the detuning, 6, of the lattice beams, which controls the sublattice 
energy offset, 4. The maximum indicates the point of inversion symmetry, 
where 4 = 0 (0 = 7 in equation (1)) and the gap at the Dirac point vanishes. 
Insets: away from the peak, the atoms behave as Dirac fermions with a tunable 
mass. Data show mean = s.d. of five consecutive measurements; solid line is a 
Gaussian fit to the data. 


with the full bandwidth, W/h = 4.6 kHz, and the minimum bandgap at 
the edges of the Brillouin zone, Eg/h = 475 Hz. 

To investigate how breaking the inversion symmetry of the lattice 
affects the Dirac points, we vary the sublattice offset, 4, which is 
controlled by the frequency detuning, 6, between the lattice beams, 
and measure the total fraction of atoms transferred to the second band, 
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é. The results obtained for a honeycomb lattice with Vx/Er =3.6(2), 
Vx /Ex =0.28(1) and Vy/Ex=1.8(1) are displayed in Fig. 2b and 
show a sharp maximum in the transferred fraction. We identify this 
situation as the point of inversion symmetry, where A = 0 (0 = n), in 
good agreement with an independent calibration (Methods). At this 
setting, the bandgap at the Dirac points vanishes. The population in 
the second band decreases symmetrically on both sides of the peak as 
the gap increases, indicating the transition from massless to massive 
Dirac fermions. 

The relative strength of the tunnel couplings between the different 
sites of the lattice fixes the position of the Dirac points inside the 
Brillouin zone, as well as the slope of the associated linear dispersion 
relation®°. However, the tunability of our optical lattice structure 
allows for independent adjustment of the tunnelling parameters in 
the x and y directions simply by controlling the intensity of the laser 
beams. For isotropic tunnellings, the slope of the dispersion relation 
around the Dirac points is the same in all directions, but is anisotropic 
otherwise. The distance from the Dirac points to the corners of the 
Brillouin zone along q, can be varied between 0 and qp/2, whereas 
qx = 0 is fixed by reflection symmetry”. Here qp = 21// denotes the 
Bloch wave vector. 

We exploit the momentum resolution of the interband transitions 
directly to observe the movement of the Dirac points. Starting from a 
honeycomb lattice with Vx/ERp=5.4(3), Vx/Eg=0.28(1) and 
Vy /Ex = 1.8(1), we gradually increase the tunnelling in the x direction 
by decreasing the intensity of X. The position of the Dirac points 
continuously approaches the corners of the Brillouin zone (Fig. 3), 
as expected from an ab initio two-dimensional band structure calcula- 
tion (Methods). The deviations close to the merging point are possibly 
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Figure 3 | Movement of the Dirac points. a, Distance from the Dirac points to 
the corners of the Brillouin zone, as measured through momentum-resolved 
interband transitions. The tunnelling in the x direction increases when the 
lattice depth Vx is decreased. The distance is extracted from the second-band 
quasi-momentum distribution after one Bloch cycle (insets). The merging of 
the two Dirac points at the corners of the Brillouin zone is signalled by a single 
line of missing atoms in the first band. Data show mean ~ s.d. of three to nine 
measurements. The solid line is the prediction of a two-dimensional band 
structure calculation without any fitting parameters. b, Energy splitting 
between the two lowest bands. It shows the displacement of the Dirac cones 
inside the Brillouin zone, as well as their deformation depending on the lattice 
depth Vx. 
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caused by the flattening of the dispersion relation between the two 
Dirac points as they approach each other’. 

When they reach the corners of the Brillouin zone, the two Dirac 
points merge, annihilating each other. There the dispersion relation 
becomes quadratic along the q, axis, remaining linear along q,,. Beyond 
this critical point, a finite bandgap appears for all quasi-momenta of 
the Brillouin zone. This situation signals the transition between band 
structures of two different topologies, one containing two Dirac points 
and the other containing none. For two-dimensional honeycomb lattices 
at half-filling, it corresponds to a Lifshitz phase transition from a 
semimetallic phase to a band-insulating phase®’. 

We experimentally map out the topological transition line by 
recording the fraction of atoms transferred to the second band, ¢, 
as a function of the lattice depths Vx and Vx, while keeping 
Vy/E = 1.8(1). The results are shown in Fig. 4a. There the onset of 
population transfer to the second band signals the appearance of Dirac 
points in the band structure of the lattice. For a given value of Vx, the 
transferred fraction, ¢, decreases again for large values of Vx, as the 
Dirac points lie beyond the momentum width of the cloud. 

To extend the range of our measurements and probe the Dirac 
points even in this region, we apply a force in the y direction. We 
hence explore a new class of trajectories in quasi-momentum space. 
This allows for the investigation of very anisotropic Dirac cones, which 
become almost flat in the q, direction as we approach the crossover to a 
one-dimensional lattice structure (Vg>>Vx). Along the q, trajectories, 
the centre of the cloud successively passes the two Dirac points during 
the Bloch cycle, effectively realizing a Stiickelberg interferometer” in 
a two-dimensional band structure. As shown in Fig. 4b, we again 
identify the topological transition by the onset of population transfer 
to the second band. The results for the transition lines obtained for the 
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Figure 4 | Topological transition. Fraction of atoms transferred to the second 
band, €, as a function of lattice depths Vx and Vx, with Vy/Ep = 1.8(1). 
Different lattice geometries (square, chequerboard, triangular, dimer and 
honeycomb) are realized (Fig. 1b). We consider trajectories in quasi- 
momentum space in the q, (a) and q, (b) directions. Each data point is a single 
measurement, and as a result there are at least 1,200 points per diagram. To 
maximize the transfer for the q, trajectories, where the cloud successively passes 
the two Dirac points, we set 0 = 1.013(1)z. For both trajectories, the onset of 
population transfer to the second band signals the topological transition, where 
the Dirac points appear. The dashed line is the theoretical prediction for the 
transition line without any fitting parameters, and the dotted line indicates the 
transition from the triangular lattice to the dimer lattice. The bottom diagrams 
show cuts of the band structure along the q, axis (qy = 0; a) and q, axis (qx = 0; 
b) for the values of Vx and Vx indicated. 
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two measurement series are in excellent agreement with an ab initio 
band structure calculation. At the transition, the transfer is expected to 
be lower by a factor of two along the central q, trajectory, as compared 
to the central q,, trajectory. This is caused by the dispersion relation at 
the corners of the Brillouin zone being quadratic in the q, direction and 
linear in q,. For the data points on the (dashed) transition line in Fig. 4, 
we find an average ratio for the transfer along q, and q,. of € = 0.52(14), 
in good agreement with the simple model. 

In this work, we have realized Dirac points with highly tunable 
properties using ultracold fermionic atoms in a honeycomb optical 
lattice. A new class of physical phenomena based on complex lattice 
topologies is now within the domain of quantum gas experiments. The 
versatility and novel observables of these systems will provide new 
insights. For example, the unique coherence of quantum gases offers 
the possibility of directly measuring, by interferometric methods, the 
Berry phase associated with the Dirac points. Topological order could 
be achieved by introducing artificial gauge fields, either through Raman 
transitions” or time-dependent lattice modulation”®. Moreover, the 
highly tunable lattice potential we have developed opens many new 
avenues for optical lattice experiments. For spin mixtures with repulsive 
interactions, the dynamic transition between dimer and square lattices 
should facilitate the adiabatic preparation of an antiferromagnetic 
phase” and allow the study of quantum criticality**. Additionally, the 
triangular and honeycomb lattices provide the possibility of exploring 
magnetic frustration and spin liquid phases'*”’. 


METHODS SUMMARY 


To prepare a non-interacting, quantum-degenerate Fermi gas of *°K atoms, we 
perform evaporative cooling of a balanced spin mixture of the |F, mp) = |9/2, —9/2) 
and |9/2, —7/2) states in an optical dipole trap at a magnetic field of 197.6(1) G. 
After reaching typical temperatures of 0.2T;, where Ty is the Fermi temperature, we 
apply a magnetic field gradient in the y direction to remove the |9/2, —7/2) 
component while levitating the |9/2,—9/2) atoms. The polarized Fermi gas is 
loaded into the optical lattice, where the dipole trap is switched off. We then apply 
an additional magnetic field gradient in either the x or the y direction, to excite 
Bloch oscillations. The two-dimensional lattice lies in the x-y plane, whereas in the 
z direction the atoms are harmonically trapped. The time phase, g, between the 
interfering beams X and Y in equation (1) is stabilized interferometrically at a value 
of g/m = 0.00(3). The measured visibility of the interference pattern is « = 0.90(5). 
The phase 0 depends on the optical path length between the atomic cloud and the 
retro-reflecting mirror, and can be precisely controlled through the detuning, 6, 
between beams X and X. We infer the precise value 0/5 = 7/384.7(6) MHz from 
the peak position in Fig. 2b. This is in good agreement with an independent 
calibration obtained using Raman-Nath diffraction on a *’Rb Bose-Einstein 
condensate, which yields 0/5 = m/388(4) MHz |. The theory curves are extracted 
from an ab initio single-particle, two-dimensional, numerical band structure 
calculation for the homogeneous system. It therefore also takes into account 
higher-order tunnelling terms, which are relevant for the regime studied here. 
In particular, they cause an asymmetry between the two lowest bands and lead 
to a tilt of the Dirac cones in certain parameter regimes. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Preparation scheme. After sympathetic cooling with *’Rb in a magnetic trap, 
2X 10° fermionic “°K atoms are transferred into a dipole trap operating at a 
wavelength of 826 nm. A balanced spin mixture of the |F, m,) = |9/2, —9/2) and 
|9/2, —7/2) states is evaporatively cooled in a magnetic field of 197.6(1) G, in the 
vicinity of the Feshbach resonance at 202.1 G. We obtain typical temperatures of 
0.2T;. The field is subsequently reduced and a magnetic field gradient is used to 
remove the |9/2, —7/2) component while levitating the |9/2, —9/2) atoms. This 
polarized Fermi gas is loaded into the two-dimensional optical lattice in 200 ms, 
and the dipole trap is then switched off. 

Tunable optical lattice. The optical lattice is produced by the combination of 
three retro-reflected laser beams of linear polarization and wavelength 
2 = 1,064 nm, red-detuned with respect to the D, and D, lines of 40K. Beams X 
and X propagate along the same axis and beam Y propagates at 90.0(1)° to that 
axis. The interference of X and Y produces a chequerboard potential of the form” 


Vix, y= — Vxcos”(kx) — Vycos*(ky) 
—2a/ Vx Vycos(kx)cos(ky)cos(g) 


The phase between the two beams at the position of the atoms is stabilized inter- 
ferometrically at a value of g/m = 0.00(3) using a pair of additional beams detuned 
from each other and from X, X and Y. This results in a weak additional lattice along 
each axis of about 0.1Eg. We use a heterodyne locking technique”’, where the two 
interfering arms are formed by the optical lattice beam paths and are each com- 
pared to a common reference beam. 

Beam X creates a potential V2(x) = — Vx cos” (kx +0/2), where the phase 0 
determines the relative position of the chequerboard pattern and the one- 
dimensional standing wave. We control the value of @ in the centre of the cloud 
by adjusting the frequency detuning, 5, between X and X. We infer the precise 
value 0/6 = /384.7(6) MHz * from the peak position in Fig. 2b. This is in good 
agreement with an independent calibration obtained using Raman-Nath diffrac- 
tion on a ®’Rb Bose-Einstein condensate, which yields 0/6 = 1/388(4) MHz l At 
the edges of the cloud, the phase differs by approximately +10° *n. 


The total lattice potential is given by V(x, y) + V2(x) and, depending on the 
relative intensities of the beams, gives rise to square, chequerboard, triangular and 
honeycomb lattices, as well as to a staggered arrangement of dimers and an array of 
weakly coupled, one-dimensional chains. 

The visibility, « = 0.90(5), and the lattice depths, Vg, Vx and Vy, are calibrated 
using Raman-Nath diffraction. The method has a systematic uncertainty of 10% 
for the lattice depths; the statistical uncertainties are given in the main text. The 
two-dimensional lattice lies in the x-y plane, whereas in the z direction the atoms 
are harmonically trapped. Owing to the absence of interactions, the z direction 
decouples. 

The underlying trap frequencies in our system scale with the lattice depths 
according to the approximate expressions @, 0 W/Vy, Wy \/ Vx + Vx Vy/Vx 
and «, 0c J Vx t+ Vx Vy /Vx + 1.24Vy. For the parameters used in Fig. 2a, we find 
that @,/2n = 17.6(1) Hz, @,/2n = 31.8(5) Hz and ,/2n = 32.7(5) Hz, as cali- 
brated from dipole oscillations of the cloud. 

Detection. The quasi-momentum distribution of the gas is probed using a band- 
mapping technique. The optical lattice beams are linearly ramped down in 500 ils, 
that is, slowly enough for the atoms to stay adiabatically in their band while quasi- 
momentum is approximately conserved”. We then allow for 15 ms of ballistic 
expansion before making an absorption image of the cloud. 

Band structure calculations. The energy spectrum is obtained using an ab initio 
single-particle, two-dimensional, numerical band structure calculation for the 
homogeneous system. It therefore also takes into account higher-order tunnelling 
terms, which are relevant for the regime studied here. In particular, they cause an 
asymmetry between the two lowest bands and lead to a tilt of the Dirac cones in 
certain parameter regimes. 
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Designer Dirac fermions and topological phases in 


molecular graphene 


Kenjiro K. Gomes!**, Warren Mar”**, Wonhee Ko***, Francisco Guinea” & Hari C. Manoharan!? 


The observation of massless Dirac fermions in monolayer graphene 
has generated a new area of science and technology seeking to 
harness charge carriers that behave relativistically within solid-state 
materials’. Both massless and massive Dirac fermions have been 
studied and proposed in a growing class of Dirac materials that 
includes bilayer graphene, surface states of topological insulators 
and iron-based high-temperature superconductors. Because the 
accessibility of this physics is predicated on the synthesis of new 
materials, the quest for Dirac quasi-particles has expanded to 
artificial systems such as lattices comprising ultracold atoms**. 
Here we report the emergence of Dirac fermions in a fully tunable 
condensed-matter system—molecular graphene—assembled by 
atomic manipulation of carbon monoxide molecules over a conven- 
tional two-dimensional electron system at a copper surface’. Using 
low-temperature scanning tunnelling microscopy and spectro- 
scopy, we embed the symmetries underlying the two-dimensional 
Dirac equation into electron lattices, and then visualize and shape 
the resulting ground states. These experiments show the existence 
within the system of linearly dispersing, massless quasi-particles 
accompanied by a density of states characteristic of graphene. We 
then tune the quantum tunnelling between lattice sites locally to 
adjust the phase accrual of propagating electrons. Spatial texturing 
of lattice distortions produces atomically sharp p-n and p-n-p 
junction devices with two-dimensional control of Dirac fermion 
density and the power to endow Dirac particles with mass**. 
Moreover, we apply scalar and vector potentials locally and globally 
to engender topologically distinct ground states and, ultimately, 
embedded gauge fields”’, wherein Dirac electrons react to ‘pseudo’ 
electric and magnetic fields present in their reference frame but 
absent from the laboratory frame. We demonstrate that Landau 
levels created by these gauge fields can be taken to the relativistic 
magnetic quantum limit, which has so far been inaccessible in 
natural graphene. Molecular graphene provides a versatile means 
of synthesizing exotic topological electronic phases in condensed 
matter using tailored nanostructures. 

The Dirac fermion has emerged as a common feature of new materials 
whose band structure and embedded spin degree of freedom are 
described by the relativistic Dirac equation’. In two dimensions, it 
has long been appreciated theoretically that the honeycomb lattice 
and mapping of the sublattice degree of freedom to a pseudospin is 
represented by the Dirac equation for electrons bound to the lattice, 
Hg = hcork, where k is the vector momentum, 6 = (¢,, a,) and o; are 
the 2 X 2 Pauli spin matrices coupled to pseudospin, ¢ is the Dirac 
fermion velocity (the effective speed of light) and h is Planck’s constant 
divided by 27. The simplest model for this physics is captured in the 
two-site tight-binding band structure characterized by a hopping- 
matrix element, t, describing the bond strength between nearest- 
neighbour atoms separated by a distance a. Although this model is 
realized naturally in graphene, where carbon atoms comprise the 


honeycomb structure, it is more general and can describe any atoms 
in the same lattice, and even electrons alone, subject to the same sym- 
metries but devoid of atomic ‘containers’. However, tunable artificial 
Dirac fermions in honeycomb lattices have not been found in either 
atomic** or condensed-matter’*"* systems. 

In this work, we show that the synthesis and control of Dirac fermions 
ina general solid-state material is readily possible and can be interpreted 
in terms of simple ideas such as the decoration of crystal surfaces with 
atoms or molecules and periodic and aperiodic assembly, and by the 
application of familiar band structure models. In principle, any two- 
dimensional electron system (2DES) can be transformed into a host for 
Dirac fermions if it is patterned with a suitable periodic array of gates. In 
the case of top gates biased to a positive potential relative to a buried 
2DES, a honeycomb arrangement of gates or dots'* is necessary to 
induce electrons to form a graphene structure. The dual of this con- 
figuration can also be exploited: for negative potentials that deplete 
electrons under the gates, a triangular gate array leaves electrons in a 
honeycomb pattern and is described by the same Dirac Hamiltonian. 
Recently we showed how single atoms can function as atomic-size gates 
of a 2DES at noble metal surfaces’’ and how simple molecules such as 
CO function as repulsive potentials for surface electrons when shaped 
into open’ and closed’* quantum structures. Here we use individual CO 
molecules arranged molecule by molecule on a Cu(111) surface as a 
tunable gate array to transform a 2DES passing through these lattices 
into Dirac fermions, whence the term ‘molecular graphene’. The specific 
potentials established induce these quasi-particles to condense into vari- 
ous topological phases. The resulting nanostructures need not even be 
periodic. In fact, control over every lattice position and potential in the 
artificial materials we construct provides unprecedented control of the 
spatial texture of the hopping parameter, ultimately allowing obser- 
vation of electronic ordering into ground states rarely encountered in 
natural systems. (See Supplementary Fig. 1 for a summary.) 

The starting point of our experiment is the nearly free 2DES on 
Cu(111), which is characterized by very long coherence lengths 
(>1,000 A near the Fermi energy, E;, where the sample bias is 
V=E/e=0 and e is the electron charge), a band edge at 
Ey = —0.45eV and an effective mass of m* = 0.38m,, where m, is 
the electron mass. For E < 0.5 eV, the dispersion relation is parabolic 
and the Fermi velocity is vz = 6.45 X 10°ms *. Carbon monoxide 
molecules are adsorbed on clean Cu(111) and a lattice is assembled 
by positioning the molecules individually using the tip of a scanning 
tunnelling microscope (Fig. la and Supplementary Video 1). All 
experiments were performed at a temperature of 4.2 K. By applying a 
periodic potential at the surface, we can embed a topological singularity 
into the energy bands, thus forming a Dirac point and producing linear 
carrier dispersion. In this work, we made molecular graphene consist- 
ing of ~100-1,000 molecules. For large lattices, the numbers of equi- 
valent C sites and C-C bonds are respectively almost two and three 
times the number of assembled molecules. 
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Figure 1 | Dirac fermions in molecular graphene. a, Sequence of constant- 
current topographs during the assembly of a molecular graphene lattice 

(V = 10mV, I= 1nA). b, Topograph of a molecular graphene lattice 
composed of 149 CO molecules (lattice constant, d= 8.8 A). c, Spatially 
averaged, normalized differential conductance spectrum, g(V) (solid line), 
measured on the top sites near the centre of quasi-neutral molecular graphene 
(d = 19.2 A), accompanied by a tight-binding DOS fit (dashed line) with 
hopping parameters t = 90 meV and t’ = 16 meV. Inset, resulting Dirac cone 
realized in reciprocal space (corresponding to fit parameters). The tight- 
binding spectrum is calculated by finding energy eigenvalues of a finite 
graphene lattice with Lorentzian basis functions (to model the finite lifetime 
due to scattering to bulk states and coupling to the two-dimensional continuum 
at the graphene edges, we used an electron self-energy = J/2, where the 
linewidth is ’= 40 meV from observed broadening of states near Ep). 

d, Linearly dispersing quasi-particles revealed by the conductance spectra 
2(E,r), plotted individually for sublattice A (filled circles: pseudospin s, = +1/ 
2, |t)) and sublattice B (open circles: pseudospin s, = —1/2, | |)), measured at 
locations r illustrated in the inset. Points for |E | S eVims Where Vims is the 
modulation voltage, are excluded from this plot because this instrumental 
broadening prohibits their accurate measurement. 
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A completed ‘flake’ of molecular graphene is shown in topographic 
form in Fig. 1b, demonstrating a perfect internal honeycomb lattice 
and discernable edge effects at the termination boundaries. The spec- 
trum shown in Fig. 1c was measured at the lattice C sites near the 
centre of a lattice built using 271 CO molecules separated by a distance 
d= \/3a = 19.23 A. The spectra in all the figures show surface-state 
conductance, g(E,r), where r denotes the measurement position. 
(Henceforth, ‘tilde’ quantities refer to continuum properties of the 
Dirac fermions.) These spectra are measured by taking the ratio, gp, 
between the measured differential tunnelling conductance and the 
spatially averaged value acquired on clean Cu(111) (Supplementary 
Fig. 2). This normalization removes the featureless slope present in the 
bare Cu spectrum and cancels the effect of possible energy-dependent 
tunnelling matrix elements that may vary between different microscope 
tips. The jump in differential conductance at the two-dimensional band 
edge, Qp= m*/nh? = 1.585eV ‘nm’, additionally provides a 
quantitative calibration of the surface density of states (DOS) and is 
used to scale gg to meaningful units (Supplementary Information). 

The edge of the gap at the M point in momentum space (Fig. 1c) is 
marked by the peak in conductance at Ey = —104meV. The Dirac 
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Figure 2 | Dirac point engineering in a p-n-p junction. Spectroscopic 
measurements made from a p-n-p lattice with alternating lattice spacings: d 
changes abruptly from 17.8 to 20.4 A and then back again. a, Topograph of the 
p-n-p lattice. The conductance spectra were measured across the centre line 
marked by the grey arrows. b, Intensity colour plot of the conductance spectra 
g(V,x), where x denotes the distance along the centre line. The white line is the 
locus of minima (the Dirac points (Ep)) in the conductance spectra. The dashed 
line marks the Fermi energy (Eg). Illustrative Dirac cones are superimposed to 
show the effective doping of each region. c, Spatially averaged, normalized 
conductance spectra measured along the centre line (marked by arrows in 

a). The first spectrum (blue, left) was measured in the left-hand, p-type, region 
(d=17.8 A), the second (orange, centre) was measured in the central, n-type, 
region (d = 20.4 A) and the third (blue, right) was measured in the right-hand, 
p-type, region (d= 17.8 A). 
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point is marked by the conductance minimum at Ep = —5 meV. 
Although the 2DES is normally decoupled from the bulk electrons for 
pure Cu, the presence of the molecules allows finite scattering and hence 
adds to the lifetime broadening evident in g. In a direct comparison with 
graphene tight binding, the nearest-neighbour hopping energy scale can 
be read from the data in Fig. 1c as t~ Ey, — Ep = 99 meV. The corres- 
ponding Dirac velocity is ¢ = 3ta/2h = (2.5 + 0.2) X 10° ms’ '. Thedis- 
tinctive graphene spectral signature observed (Fig. 1c) can be contrasted 
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with the electronic structure of the honeycomb lattice inverse, which is a 
(non-Dirac) triangular electron lattice (Supplementary Fig. 3). The suc- 
cessful match to tight-binding theory in both cases demonstrates the 
promise of these techniques for realizing many other lattice types. 

A complementary model uses the nearly free electron model and 
‘muffin-tin’ potential’’ to extract the properties of the emergent Dirac 
fermions. This approach (Supplementary Information) yields the 
position of the Dirac point, centred at the K points of the supercell 
Brillouin zone. For a wide range of Fourier coefficients of the 
superlattice potential, the nearly free electron model predicts that 
t= 4)/3nh7/27m*a? = Ey, — Ep = 118 meV and that ¢ = 2nh/3\/3m*a = 
(3.3 £0.2) X10°ms ', in good agreement with the parameters 
deduced from the tight-binding model. 

The pseudospin structure of the Dirac point can be probed by 
directly tunnelling into the A and B sublattice sites’’. Figure 1d shows 
plots of E = E — Ep versus g for the two sublattices, with a sign modi- 
fication depending on which sublattice is probed. This method inter- 
rogates the underlying wavefunction overlap with each pseudospin s_, 
and reveals the underlying massless Dirac fermion structure in the 
assembled nanomaterial. The slopes of the observed linearly vanishing 
DOS yield ¢ = (3.1 + 0.2) X 10°ms_‘. This and the above values for ¢ 
are in reasonable agreement and also match an expected theoretical 
value of ¢~vp/2~3.2X10°ms | for the nearly free electron 
model’’. Just as in real graphene’, an electron-hole asymmetry is 
observed in our molecular graphene (Fig. 1c). This asymmetry is 
due to a finite second-nearest-neighbour hopping-matrix element, 
t'. A full tight-binding fit to the experiment (Fig. 1c) yields 
t= 90 meV and t' = 16 meV. These values agree well with the poten- 
tial model (Supplementary Information). The resulting Dirac cone 
centred at each K point in reciprocal space is plotted in Fig. 1c (inset). 

Although EF; is fixed for the underlying 2DES and cannot be changed 
by conventional electrostatic gating, we are able to modify the lattice 
parameters to change the electron count per superlattice unit cell and 
thus control the graphene doping level. Making these changes on the 
atomic scale permits a change in the Dirac fermion carrier concentra- 
tion over very short distances. To demonstrate this idea, we built 
molecular graphene with alternating CO lattice spacings, starting with 
d=17.8A and abruptly changing to 20.4 A and then back to 17.8 A 
(Fig. 2). In the lattice with the smaller spacing, the Dirac point is located 
46 meV above Ey, leading to a hole-doped (p-type) lattice with a carrier 
density of 1.41 X 10'*cm * (Fig. 2a). The lattice with the larger spacing 
is electron doped (n-type), with the Dirac point located 49 meV below 
Ey and a carrier density of 1.23 X 10'* cm * (Fig. 2c). We measure the 


Figure 3 | Charge- and bond-density waves in molecular graphene. 

a, Topographic image of quasi-neutral molecular graphene (d = 19.2 A), with 
an additional CO molecule at the top site (location indicated in inset). 

b, Impurity scattering quasi-particle interference mapped through the 
subtraction of two topographs (V = 10 mV, I = 1 nA) measured in identical 
fields of view and distinguished only by the presence of the extra CO molecule 
at the top site. c, Topograph of quasi-neutral molecular graphene (d= 19.2 A), 
with the central CO molecule removed from the empty site (location indicated 
in inset). d, Corresponding difference map of two topographs measured as 
above, revealing quasi-particle interference of higher symmetry. e, Left: 
schematic of a pure molecular graphene lattice (grey circles denote CO 
molecule positions, small black dots are C sites, blue lines represent uniform 
hopping parameter t). At bottom, the schematic is overlaid with a topograph of 
the corresponding experimental lattice (d = 17.8 A). Right: schematic of the 
modifications to pure molecular graphene to obtain a Kekulé hopping texture. 
The addition of extra CO molecules (grey circles) splits the nearest-neighbour 
hopping parameter into two different values (t, in purple and tf, in pink, t; > 4), 
as illustrated. At bottom, the transformation schematic is overlaid with 
measured topography from experiment after molecular manipulation 

(d= 19.2 A). f, Conductance spectra measured in pure molecular graphene 
(blue) and in the graphene lattice after the Kekulé texturing (red). The spectra 
display the opening of a gap, 4, at the Dirac point. Dashed curve, tight-binding 
fit calculated with t; = 2t,. The small difference in d compensates for the 
change in chemical potential introduced by the extra CO molecules. 
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surface spectrum (g) across the lattice along the line indicated (Fig. 2a, b, 
arrows), crossing all regions. Because there are no charging effects, the 
interface between the p- and n-type regions is very narrow: it is about 
20 A wide (Fig. 2b). The extremely short transition between the p- and 
n-type regions makes this device a suitable candidate in which to study 
phenomena such as the Klein paradox" or to create a Veselago lens”. 

Using atomic manipulation, we created both pseudospin-conserving 
and pseudospin-breaking local disturbances further to reveal the 
Dirac nature of our system. We started with quasi-neutral molecular 
graphene with Ep near Ey (271 CO molecules, d= 19.2 A) and tested 
two symmetries (a C-site defect (Fig. 3a), which locally imbalances the 
sublattices, and an empty-site vacancy (Fig. 3c), which locally preserves 
sublattice symmetry); these two structures are topologically distinct 
because their potentials correspond respectively to a local vector 
potential and a local scalar potential coupling to pseudospin. By sub- 
tracting two low-bias scanning tunnelling microscope topographs, one 
with the impurity and the other without it, and both locked to exactly the 
same area and using identical measurement parameters, we obtain 
detailed pseudospin maps resulting from the tiny DOS perturbations 
caused by quantum interference. The distinct patterns observed have the 
three-fold symmetry (Fig. 3b) predicted for single impurities in graphene 
that disrupt the Berry phase”, rather than the full six-fold symmetry of 
a scalar perturbation, which conserves pseudospin”’”* (Fig. 3d). 

Topological changes fundamentally alter the lattice symmetry and 
are the key to unlocking physical phenomena such as electron fractio- 
nalization® *'°"'***, In graphene, one of the simplest (yet unrealized) 
deformations is the Kekulé distortion rooted in the historical inter- 
pretation of benzene. This distortion breaks the bond symmetry of 
graphene by forming two hopping elements, ¢; and t, in the pattern 
shown in Fig. 3e. We produce this distortion using a special ‘Mercedes’ 
arrangement of CO molecules in the honeycomb empty sites. This has 
the effect of modulating the strength of every other C-C bond along 
the perimeter of each cell. Such a distortion adds an off-diagonal term 
to the Hamiltonian", such that 

A L x2 ) 
—htork 


Pe hcork 

( A “h x2 

where Jy x2 is the two-by-two identity matrix and an asterisk denotes 
complex conjugate. This distortion is predicted to open an energy gap, 
A, even if the underlying sublattice symmetry is not broken; notably, this 
effect has never been observed. Figure 3f proves that the Kekule distor- 
tion works as theoretically predicted, creating massive Dirac fermions 
out of the massless Dirac fermions in the pristine lattice. From fits to 
theory, the mass of the emergent fermions is mp = 0.1 + 0.02 m,. The 
Kekulé ground state’®'**”° has an intriguing mapping to a supercon- 
ducting topological surface state*®, after pseudospin is mapped to spin 
and the valley degree of freedom to an isospin, equivalent to attaching a 
scalar gauge field that produces a Dirac fermion mass. This scalar gauge 
field is manifest in the bond-density wave mosaic structure visible in 
Fig. 3e. The transition from massless to massive Dirac fermions has 
been theoretically cast as a quantum phase transition*"’; the molecular 
graphene system provides an experimental test bed of these ideas start- 
ing with the spontaneous generation of mass observed here. 

The chiral character of the electronic charge in graphene is due to the 
pseudospin associated with the symmetry between the two triangular 
sublattices that form the honeycomb lattice. It has been proposed that 
by breaking this sublattice symmetry through strain, it is possible to 
generate a pseudomagnetic field and therefore obtain Landau levels 
and quantum Hall phases without breaking time reversal symmetry. 
The effect of strain has recently been observed in graphene nano- 
bubbles’, but tunable molecular graphene offers much more precise 
and in situ control over internal gauge fields. The strain field displace- 
ments in polar coordinates (r and 0) suggested? to generate a constant 
field are (u,, ug) = (qr sin(30), qr cos(30)), where q is a parameter denot- 
ing the strength of the strain. In our final experiment, we applied this 
strain field to molecular graphene by means of atomic manipulation. 
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Topographs for successively strained graphene are shown in 
Fig. 4a. The value of the pseudomagnetic field can be estimated as 
B= 8Phqlea = 16nqh/3de, where f = — Oln(t)/dln(a) ~ 2 (Supplemen- 
tary Information). We study strain values up to q = 10° A” ', whichis 


g (experiment) 
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Figure 4 | Landau quantization and topological zero modes in a tunable 
pseudomagnetic field. a, Sequence of topographs of molecular graphene 
lattices with increasing values of triaxial strain. The position of each CO 
molecule was determined by the dislocation vector defined in the main text. 
From bottom to top, q = 0, 2.5 10-4,5X 10-4, 7.5 10 * and 10 *A7t. 
The corresponding values of the constant pseudomagnetic field are B=0, 15, 
30, 45 and 60 T (felt in opposite directions by the two graphene valleys; see 
d, top inset). b, Topograph at the centre of the lattice without strain distortion 
(q=0A_'), showing the unbroken symmetry between each sublattice 
(pseudospin) of the honeycomb. c, Topograph at the centre of the lattice with 
strain distortion (q = 10 * A‘), showing the broken symmetry between each 
sublattice (one bright and one dark) as a result of the localization of the zero 
Landau level on half of the sample (bright sublattice). d, Left: normalized 
conductance spectra measured on sublattice A (brighter top sites in c and 
orange circles in inset schematic) for successive values of strain. The spectra 
were measured near the centres of the lattices shown in a. Right: spectra 
measured on sublattice B (darker top sites in c and blue circles in inset 
schematic) for the same successive values of strain, showing the opening of a 
Landau gap. Grey solid curves are tight-binding fits of strained finite lattices to 
the experiment. The grey dotted curve shows a tight-binding calculation in a 
real magnetic field, B = 60 T, for an unstrained lattice of the same dimensions 
and hopping parameters. 
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equivalent to a pseudomagnetic field of B ~ 60 T, corresponding now 
to a vector gauge field dressing the Dirac fermions (the applied triaxial 
strain can be contrasted with uniaxial strain (Supplementary Fig. 4), 
which is devoid of gauge fields). The breaking of the pseudospin sym- 
metry of the Dirac point can be probed by direct tunnelling into the 
individual sublattice sites. Once the strain is applied, the symmetry 
breaking between the two sublattices is discernible in the topographs 
(Fig. 4b, c) and can be noticed in the spectra (Fig. 4d). In the ‘bright’ 
sublattice (A), a well-defined zero-bias state—the zero Landau level— 
forms and gains prominence with increasing strain. In the ‘dark’ 
sublattice (B), the spectra are characterized by a reduction in the con- 
ductance at zero bias with strain—the formation of a Landau gap— 
visible through the transformation of typical V-shaped spectra into 
U-shaped contours. 

The presence ofa zeroth Landau level at the Dirac point is one of the 
hallmarks of the quantum Hall effect with Dirac fermions. The low 
Fermi velocity here means that even a small strain can push the first 
excited Landau level beyond the linear regime in the band structure, 
towards or beyond the edge of the M-point gap. We estimate that for 
our lowest value of strain (q = 0.25 x 10 ° A), B~15T and the 
n= 1 Landau level therefore lies at 33 mV. The registry of the CO 
molecules to the surface Cu atoms quantizes the allowed lattice sizes 
and hinders the formation of lattices with smaller strains. Nevertheless, 
a complete quantitative model of our finite system shows that it 
behaves as a graphene quantum dot in a real magnetic field, B, when 
these strain fields are applied (in Fig. 4d, tight-binding fits for finite 
strain and zero B and for zero strain and high B both provide excellent 
reproductions of the observations). 

Owing to the large strains applied, the particular topological state 
observed here is an ultraquantized ground state: only the zeroth 
Landau level is present and fractionally occupied by Dirac fermions. 
Such a state is unprecedented in normal graphene owing to the larger 
energy scales and, hence, very large magnetic fields necessary to study 
it. The pseudospin symmetry breaking observed is connected with the 
fundamental index theorem, which predicts the same topological 
phase for Dirac fermions ona lattice” as is observed here. In this phase, 
the electrons in the zero mode form a peculiar condensate in which 
they become completely delocalized over the sample but remain 
alternately localized and antilocalized in the pseudospin projection” 
(Supplementary Information). The image in Fig. 4c provides a snapshot 
of this intriguing phase in real space. 

The study of artificial lattices may lead to technological applications, 
but they also provide a new level of control over Dirac fermions and 
allow experimental access to a set of phenomena that has hitherto been 
considered only theoretically. The introduction of tunable interactions 
between the electrons could lead to the formation of spin liquids in 
graphene, and the addition of spin-orbit coupling could lead to a 
quantum spin Hall effect. The time-reversal-invariant quantum 
Hall state observed here already has a direct connection to the two- 
dimensional topological insulator’. In future, such topological phases 
may be produced even more easily by extending our manipulation 
techniques to self-assembly methods*’, which naturally generate 
long-range periodic potentials on decorated surfaces. 
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Matter structured on a length scale comparable to or smaller than 
the wavelength of light can exhibit unusual optical properties’. 
Particularly promising components for such materials are metal 
nanostructures, where structural alterations provide a straightforward 
means of tailoring their surface plasmon resonances and hence their 
interaction with light’’. But the top-down fabrication of plasmonic 
materials with controlled optical responses in the visible spectral 
range remains challenging, because lithographic methods are limited 
in resolution and in their ability to generate genuinely three- 
dimensional architectures**. Molecular self-assembly®’ provides an 
alternative bottom-up fabrication route not restricted by these 
limitations, and DNA- and peptide-directed assembly have proved 
to be viable methods for the controlled arrangement of metal 
nanoparticles in complex and also chiral geometries*"*. Here we 
show that DNA origami‘*”* enables the high-yield production of 
plasmonic structures that contain nanoparticles arranged in nano- 
metre-scale helices. We find, in agreement with theoretical predic- 
tions'’’, that the structures in solution exhibit defined circular 
dichroism and optical rotatory dispersion effects at visible wave- 
lengths that originate from the collective plasmon-plasmon inter- 
actions of the nanoparticles positioned with an accuracy better 
than two nanometres. Circular dichroism effects in the visible part 
of the spectrum have been achieved by exploiting the chiral 
morphology of organic molecules and the plasmonic properties of 
nanoparticles'*”°, or even without precise control over the spatial 
configuration of the nanoparticles'’**’””. In contrast, the optical 
response of our nanoparticle assemblies is rationally designed and 
tunable in handedness, colour and intensity—in accordance with our 
theoretical model. 

To demonstrate the potential of DNA origami for the programmable 
and nanometre-precise design of plasmonic nanostructures, we targeted 
an optical response that emerges as a collective effect from spatially 
precisely arranged multiple nanoparticles in close proximity, and that 
requires genuinely three-dimensional structures. The response of 
helically arranged metallic nanocrystals to an electromagnetic field 
meets both criteria. Much like the circular dichroism (CD, the differ- 
ential absorption of left and right circularly polarized light) effect seen 
with ‘optically active’ chiral molecules such as DNA and proteins that 
exhibit CD in the ultraviolet and infrared ranges owing to the elec- 
tronic and vibronic excitations of their chiral secondary structure”’, 
strong CD signals in the visible spectrum have been predicted to occur 
through collective Coulomb interaction of plasmonic dipoles in chiral 
assemblies of metal nanoparticles’’. (See Supplementary Information 
notes 1 and 2 for information on naturally occurring CD, and on the 
theoretical framework of plasmon-induced CD.) 

To create helical nanoparticle assemblies, we used the DNA 
origami’*'° approach where hundreds of rationally designed ‘staple’ 
oligonucleotides are hybridized to a long single-stranded DNA scaffold 


to force it into a specific two- or three-dimensional shape. The result- 
ing objects are fully addressable by their DNA sequence, enabling 
further decoration and functionalization in a unique, sequence-specific 
manner™*. Geometrical and material parameters can be tailored by 
simple adjustments of the design and fabrication protocols, which 
should make it feasible to quantitatively tune the optical activity of 
such nanostructures dispersed in solution. 

Figure 1a illustrates the nanostructure design (design details and 
DNA sequences are given in Supplementary Information note 3). It is 
based on DNA origami 24-helix bundles that offer nine helically 
arranged attachment sites for plasmonic particles coated with single- 
stranded DNA, here gold nanoparticles with a diameter of 10 nm. By 
using nanoparticles covered with multiple linker strands, we can avoid 
yield-lowering purification procedures and instead add an excess of 
modified gold particles for hybridization to the DNA structures; 
unconjugated gold particles are simply removed for re-use in later 
experiments. The quality of the assemblies was assessed by trans- 
mission electron microscopy (TEM) (Fig. 1b) (experimental details 
and additional TEM images can be found in Supplementary 
Information note 4). The strength of the optical activity critically 
depends on the quality of the assembled structures”, so we improved 
our experimental protocol to achieve a nanoparticle attachment yield 
of 96% and 98% per site and an overall yield of 77% and 86% of perfect 
assemblies of right- and left-handed helices with nine nanoparticles, 
respectively (Supplementary Information note 5). Most of the 
imperfect structures exhibited only a single defect within an otherwise 
well-formed helix. Helices stored for three months at 4°C were still 
intact (Supplementary Information note 6). As illustrated in Fig. Ic, 
CD measurements were then carried out on solutions containing 
left- or right-handed nanohelices. 

Left-handed helical arrangements of plasmonic particles are 
expected to produce a CD signal with a characteristic bisignate peak- 
dip shape, with the peak centred around the plasmon resonance fre- 
quency. Right-handed helices should produce a vertically mirrored 
dip-peak signal. Indeed, CD spectra measured on samples with gold 
nanoparticle assemblies of both helicities exhibited the anticipated 
signatures (Fig. 2a). These responses arise because the helical arrange- 
ment of the gold nanoparticles results in coupled plasmon waves 
propagating along a helical path and causing increased absorption of 
those components of the incident light that are in accord with the 
handedness of the helices. The bisignate appearance of the CD signals 
results from the isotropic nature of our samples, in which the 
nanohelices are randomly oriented with respect to the direction of 
the incident light. The interactions between the individual gold particles 
within each helix create a splitting between the longitudinal and trans- 
verse modes of the electromagnetic wave, and these modes typically have 
opposite chirality. As a result, the plasmonic CD spectrum acquires the 
characteristic dip-peak shape (Supplementary Information note 2). 
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Figure 1 | Assembly of DNA origami gold nanoparticle helices and 
principle of circular dichroism. a, Left- and right-handed nanohelices 
(diameter 34 nm, helical pitch 57 nm) are formed by nine gold nanoparticles 
each of diameter 10 nm that are attached to the surface of DNA origami 24- 
helix bundles (each of diameter 16 nm). Each attachment site consists of three 
15-nucleotide-long single-stranded extensions of staple oligonucleotides. Gold 
nanoparticles carry multiple thiol-modified DNA strands, which are 
complementary to these staple extensions. Nanoparticles and 24-helix bundles 
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Figure 2 | Circular dichroism of self-assembled gold nanohelices. 
Experimental (a and c) and theoretical (b and d) CD spectra of left-handed (red 
lines) and right-handed (blue lines) helices of nine gold nanoparticles show 
characteristic bisignate signatures in the visible. a, CD spectra of nanohelices 
composed of 10-nm gold particles. The peak position in the spectra of left- 
handed helices (indicated by the arrow) coincides with the plasmon absorption 
resonance maximum. b, The theoretically predicted CDs for the geometries in 
a exhibit the same features; the peak positions and amplitudes are in remarkable 
agreement with the experiment. c, The CD signal increases owing to collective 
plasmonic enhancement by a factor of 400 for assemblies of nanoparticles with 
16-nm diameter, rendering the noise in the spectra invisible (as in a). The peak 
position for left-handed helices exhibits a red-shift from 524 nm to 545 nm. 
d, The corresponding theoretical calculation predicts a 500-fold enhancement 
of the signal and a peak shift from 523 nm to 534nm. The CD spectra were 
recorded at concentrations of nanohelices of 1.5nM ina and 0.4nM in c. The 
insets in a and c show TEM images of left-handed (red frame) and right-handed 
(blue frame) nanohelices (scale bars, 20 nm); the respective left-handed model 
geometries are depicted in the insets to b and d. 
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are mixed for assembly and the resulting constructs are gel-purified. b, TEM 
image of assembled left-handed gold nanohelices (scale bar, 100 nm). Analysis 
of the TEM data yields a 98% success rate for directed attachment of 
nanoparticles. c, Circular dichroism is measured as the difference in absorbance 
AA = Aicp — Arcp Of left-hand-circularly polarized (LCP) and right-hand- 
circularly polarized (RCP) light as a function of wavelength. CD measurements 
were performed with a CD spectrometer on samples in cuvettes of optical path 
length 3 mm. 


The experimental spectra agree well with theoretical calculations 
based on classical electrodynamics (Fig. 2b). The strongest absorption 
of circularly polarized light is predicted in the vicinity of the surface 
plasmon frequency of the metal nanoparticles, as experimentally seen 
with our assemblies. Strikingly, even the strength of the measured 
signals closely matches the magnitude predicted by the calculations. 
We further confirmed theoretically that the CD signal remains robust 
to positional perturbations of the individual particles (see Supplemen- 
tary Information note 2). 

The dipole theory’’ predicts that the CD signal becomes stronger 
when the particles are either larger or arranged in a tighter helix: 

CDplasmon™ eens (1) 
where ayp and Ro are the nanoparticle and helix radii, respectively. We 
quantified the influence of particle size on the CD signal by using 
electroless deposition of metal from solution**”” onto 10-nm seeding 
particles that were already assembled into the helical geometry 
described in Fig. 1 (see Supplementary Information note 7 for experi- 
mental details and additional TEM images). CD measurements of 
these ‘enhanced’ samples showed two notable effects (Fig. 2c): the 
signal strength increased up to 400-fold for nanoparticle diameters 
of 16 + 2nm, and the absorption as well as the CD peak shifted to 
longer wavelengths. These results are consistent with theoretical pre- 
dictions for the plasmonic CD effect. A simple estimate for the CD 
signal increase based on equation (1) and assuming a particle size of 
16 nm yields an enhancement of about 280, and quantitative numerical 
calculations predict an enhancement of about 500. As in the experi- 
ments, the calculated CD for larger particles and the same helix radius 
is also red-shifted (Fig. 2d), which is typical for strongly interacting 
plasmonic nanocrystals. Taken together, these observations illus- 
trate that desired optical behaviour can be designed and realized by 
exploiting the collective interactions between plasmonic particles 
attached with close to 100% yield and positioned with nanometre-scale 
precision. 

To generate CD at other wavelengths, we plated silver onto pre- 
assembled 10-nm gold particle helices and thereby produced a silver 
shell of about 3nm around each of the gold nanoparticles. Plasmon 
resonances in silver occur at a shorter wavelength than in gold, so the 
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recorded CD spectra were shifted into the blue spectral region (Fig. 3a). 
To fine-tune the optical response to intermediate wavelengths, we used 
mixtures of gold and silver ions in the plating solution to grow alloy 
shells around the seed particles. Figure 3a presents a TEM image 
(upper inset) with the corresponding CD spectrum of an alloy-coated 
structure. As expected, the CD signal for the alloy helices appears at an 
intermediate wavelength (absorption spectra and additional CD spectra 
are presented in Supplementary Information note 8). Mixing of solu- 
tions containing different types of structures resulted in a linear super- 
position of the corresponding spectra (Fig. 3a). These experiments 
indicate the potential of our assembly method for the fabrication of 
optically active materials with customized spectral response. 

To visualize the collective optical activity of our self-assembled 
nanohelices in a macroscopic optical experiment, we exposed gold 
nanoparticle helices to excessive amounts of silver ions during electro- 
less metal deposition and thereby achieved a giant molar CD of around 
10°M~ ‘cm. The corresponding spectra are presented in Fig. 3b. 
Droplets of samples containing left-handed or right-handed helices 
anda non-helical control droplet were placed onto a glass slide between 
two linear polarizers. As demonstrated in Fig. 4 with polarization- 
resolved transmission images, the droplets with gold helices rotate 
the polarization angle of the linearly polarized light. Although our 
set-up is insensitive to polarization below 550nm, rotation of the 
polarization axis of the linearly polarized light was detected for both 
right- and left-handed helices in the red band of the visible spectrum. 
The observation is in accordance with the effect of optical rotatory 
dispersion, which is characteristic of chiral materials exhibiting CD. 
In the case of our right-handed “metafluid””, the polarization axis is 
rotated clockwise (or counter-clockwise) at wavelengths above (or 
below) 560nm; the opposite effect is seen with the left-handed 
nanohelices. The control sample containing isotropically dispersed 
10-nm gold nanoparticles exhibited no such optical activity. Further 
details on the materials characteristics (anisotropy factor and specific 
rotation) can be found in Supplementary Information notes 8 and 9. 

In future experiments we will explore the possibility of realizing 
materials with a negative refractive index without the requirement of 
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Figure 3 | Spectral tuning of circular dichroism by metal composition. 

a, A pronounced blueshift of the peak position in the CD response was achieved 
by depositing several nanometres of silver on gold nanoparticles of diameter 
10 nm (blue spectrum of left-handed gold-core/silver-shell nanohelices; red 
spectrum from Fig. 2c is shown for reference). Growth of silver+ gold alloy 
shells gives rise to an intermediate blueshift (green spectrum of left-handed 
gold-core/silver+ gold-shell helices). Mixing of solutions that contain 
nanohelices of various metal compositions results in a superposition of 
individual CD responses (black spectrum): a 2:3 mixture of gold-core/silver- 
shell (blue spectrum) and gold-core/silver+ gold-shell (green spectrum) 
exhibits CD close to the predicted superposition (dashed black line; black 
curves were rescaled by a factor of 1.7 for presentation purposes). b, Excessive 
metal deposition leads to the merging of nanoparticles and further 
enhancement of the CD signal for the left-handed (orange spectrum) and right- 
handed (purple spectrum) gold-core/silver-shell helices. The corresponding 
molar CD in the visible spectrum is huge with a strength of about 

10°M ‘cm '. The CD spectra were recorded at concentrations of nanohelices 
of 0.4nM ina and 40 pM in b. The insets in a and b show TEM images of left- 
handed (blue, green and orange frames) and right-handed (purple frame) 
nanohelices (scale bars, 20 nm). 
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Figure 4 | Optical rotatory dispersion of self-assembled gold nanohelices. 
a, The macroscopic optical activity of the metafluid is demonstrated in 
polarization-resolved transmission by placing droplets of left-handed and 
right-handed nanohelices as well as a control droplet with isotropically 
dispersed gold nanoparticles between two linear polarizers. The components of 
the transmission set-up are ordered along the propagation path of white light as 
follows: linear polarizer, sample, linear analyser, screen (charge-coupled-device 
camera). The extinction characteristics of the linear polarizers render the set-up 
polarization-insensitive for wavelengths below 550 nm. b, Real-colour 
transmission images (scale bar, 1 mm) of the droplets for clockwise (top) and 
counter-clockwise (bottom) rotation of the analyser by one degree out of the 
orthogonal configuration. The droplets of right-handed and left-handed helices 
appear red in the top and bottom images, respectively, as a result of optical 
rotatory dispersion. Right- (or left-) handed nanohelices rotate the polarization 
axis of the transmitted light in positive (or negative) and negative (or positive) 
directions in the red and blue spectral bands, respectively. The rotation of the 
polarization in the red band is sensitively detected by the set-up, but green and 
blue components of white light illumination (below 550 nm) are not 
polarization-resolved and contribute only to the bluish background. 


either ¢ or ys being negative’, using fluids with oriented or densely 
packed plasmonic helices, which should lead to a further increase of 
the chiral optical response. But irrespective of whether this goal can be 
realized, the present findings clearly establish DNA origami as a valuable 
addition to the existing tools at the material engineer’s disposal for 
precisely arranging nanoparticles into assemblies with desired electric 
or magnetic properties. 
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Asymmetric spiroacetalization catalysed by confined 


Bronsted acids 
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Acetals are molecular substructures that contain two oxygen- 
carbon single bonds at the same carbon atom, and are used in cells 
to construct carbohydrates and numerous other molecules. A dis- 
tinctive subgroup are spiroacetals, acetals joining two rings, which 
occur in a broad range of biologically active compounds, including 
small insect pheromones and more complex macrocycles'”. Despite 
numerous methods for the catalytic asymmetric formation of other 
commonly occurring stereocentres, there are few approaches that 
exclusively target the chiral acetal centre and none for spiroacetals**. 
Here we report the design and synthesis of confined Bronsted acids 
based on a C,-symmetric imidodiphosphoric acid motif, enabling a 
catalytic enantioselective spiroacetalization reaction. These ration- 
ally constructed Bronsted acids possess an extremely sterically 
demanding chiral microenvironment, with a single catalytically rel- 
evant and geometrically constrained bifunctional active site. Our 
catalyst design is expected to be of broad utility in catalytic asym- 
metric reactions involving small and structurally or functionally 
unbiased substrates. 

Spiroacetal natural products are widely found in insects, plants, and 
bacterial and marine sources, and display a diverse set of biological 
activities'’. Their spiroacetal subunit is not only essential for the bio- 
activity but is also a privileged pharmacophore in drug discovery’. 
Controlling the relative and absolute configuration of spiroacetals 
can be extremely important, and syntheses of natural products dis- 
playing the thermodynamically unfavoured spiroacetal configuration 
represent daunting challenges’. Furthermore, natural products that 
contain a spiroacetal as the only source of chirality are well known’. 
Indeed, even the parent 6,6-spiroacetal (la in Fig. la) is a natural 
product, olean, the major female-produced sex pheromone of the olive 
fruitfly (Bactrocera oleae). Although isolated as a racemate from natural 
sources, olean’s two mirror images display the remarkable property that 
one isomer, (R)-1la, is active on males, and the other one (S)-la on 
females’. Asymmetric syntheses of olean have been published but 
invariably rely on chiral starting materials or reagents*. It has been 
shown that enzymes are capable of stereospecific spiroacetal forma- 
tion’, and organic chemists have developed several powerful synthetic 
methods for the stereoselective formation of suitably substituted 
spiroacetals~'®'*. However, so far direct access to spiroacetal motifs 
in a catalytic enantioselective manner has not been achieved. 

Encouraged by our studies of catalytic asymmetric acetalizations’, 
we targeted chiral Bronsted acid catalysed spiroacetalizations of readily 
available hydroxyenol ethers, such as 2a (Fig. 1b, Supplementary Fig. 1). 
Although the envisaged asymmetric hydroetherification of enol ethers 
is unknown with chiral Bronsted acid catalysts*, achiral acids are 
routinely used to perform the non-asymmetric reaction. However, an 
extensive screening of a wide range of known chiral Bronsted acids'*""* 
gave only disappointing results in the generation of both the 5,5- and 
the 6,6-spiroacetal (the best enantiomeric ratio was 29.5:70.5; 
Supplementary Tables 1 and 2). 

We hypothesized that the absence of a compact chiral environment 
for the small oxocarbenium ion intermediate, which lacks specific and 
sterically well defined interactions with its chiral anion, resulted in 


poor enantiofacial discrimination. Whereas large active sites can 
accommodate various transition state geometries leading to different 
isomers, a confined space could limit this freedom and thereby 
increase selectivity. On the basis of this hypothesis, we initiated 
attempts towards modelling sterically demanding acid catalysts for 
the envisaged spiroacetalization reaction. We realized that a solution 
to the spiroacetalization problem could potentially also tackle an 
important issue of broader significance in current Bronsted acid 
catalysis. Although this field has acquired wide popularity and import- 
ance in recent years, reactions of small aliphatic substrates that do not 
possess sterically demanding protecting groups, large aromatic/planar 
surfaces, or bulky substituents are still extremely rare’?'°". We 
reasoned that this is due to the inability of current synthetic Bronsted 
acid catalysts to provide a truly compact chiral microenvironment. We 
speculated that such an environment could effectively restrict molecu- 
lar motion, leading to efficient enantiodiscrimination even of substrates 
or intermediates that lack spatially defined interactions, such as hydro- 
gen bonding with the catalyst (Fig. 1c). Here we present the design of 
simple and readily available synthetic Bronsted acid catalysts 
featuring a sterically highly demanding and rigid chiral microenviron- 
ment around their active site. The design is inspired by enzymes that 
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Figure 1 | Towards catalytic asymmetric synthesis of olean. a, The two 
enantiomers of olean from olive fruitfly. b, Proposed catalytic asymmetric 
spiroacetalization. c, Catalyst design; see text for details. e.r., enantiomeric ratio. 
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frequently display deep active site pockets constructed by the folding of 
the protein, and also by the architectures provided by artificial catalytic 
cavities*”*", 

Our design was based on the analysis of highly successful chiral 
phosphoric acids, in particular those based on the BINOL (1,1'- 
binaphthalene-2,2'-diol) backbone’*”*, their derivatives, N-triflyl 
phosphoramides’’, and the more recently developed N-phosphinyl 
phosphoramides and C,-symmetric imidodiphosphoric acids (Fig. 2a). 
The active site of a BINOL-derived phosphoric acid is composed of 
one Brgnsted acidic (-OH) and one Bronsted basic site (=O). The 
remarkable success of phosphoric acids in asymmetric catalysis is 
widely attributed to bifunctional activation, in which both of these sites 
are involved in the stabilization of a transition state’> (Fig. 2b). 
Although bulky phosphates 3 (Fig. 2a) have found wide application 
in asymmetric catalysis, it is challenging to further modify their steric 
environment because 3,3'-substituents on BINOL radiate away from 
the active site. Recently, significant synthetic efforts have been under- 
taken by a number of groups to design alternative backbones that 
would narrow the chiral environment of the phosphoric acid’*”. 
C,-Symmetric anions 4 and particularly 5 (Fig. 2a) seem to provide a 
simple alternative way of introducing additional steric demand close to 
the active site. However, both of these possess several different 
Bronsted basic sites potentially giving a large number of geometrically 
distinct acid/base pairs compared to only one such pair present in the 
phosphoric acid (Fig. 2a, b). This effectively leads to an ensemble of 
catalytically active species, which can stabilize different transition 
states, resulting in a low selectivity. Furthermore, rotational freedom 
around P-N and N-S bonds renders flexible the relative positioning of 
acidic and basic sites, and therefore the entire chiral environment. 
Indeed, although anions 4 and 5 display remarkable efficiency in a 
specific set of reactions, both of these structures lack the generality of 
phosphoric acids. We reasoned that only one distinct and geometrically 
fixed acid/base pair, translating into a single type of basic site in the 
corresponding anion is a prerequisite for the construction of a success- 
ful and broadly applicable Bronsted acid catalyst. 
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Figure 2 | Development of C,-symmetric imidodiphosphoric acids. 

a, Phosphate anion and its derivatives (catalytic moiety shown in blue), 
showing numbering of different Bronsted basic sites (one in green and 
additional ones in red). b, Bifunctional activation of reactants with Bronsted 
acid/base pair in phosphoric acid catalysis. c, General imidodiphosphosphate 


316 | NATURE | VOL 483 | 15 MARCH 2012 


On the basis of these considerations, we designed a novel class of 
Bronsted acids, using a C-symmetric imidodiphosphate anion 
(Fig. 2c). In principle, such a C,-symmetric imidodiphosphate moiety 
has two distinct Bronsted basic sites, O and N. The corresponding acid 
should have a flexible relative positioning of acid/basic pairs owing to 
free P-N rotation, and would not satisfy the above mentioned criteria 
for the design of a general chiral Bronsted acid. However, our catalyst 
design aimed at restricting the imidodiphosphate moiety to a single 
O,O-syn conformation between two identical BINOL subunits with 
bulky 3,3’-substituents (Fig. 2d). We hypothesized that the inclusion 
of two BINOL subunits would result in their interlocking due to sterically 
demanding 3,3'-substituents (Fig. 2d). As a direct consequence, the 
BINOL subunits are unable to freely rotate and the resulting molecular 
structure possesses a very high rigidity. Importantly, such an arrange- 
ment also resulted in the steric blocking of the undesirable alternative 
Bronsted basic N-site. As the two BINOL subunits are identical, 
anion 6 (Fig. 2d) is C,-symmmetric, and has therefore only a single 
type of catalytically relevant Bronsted basic site. Consequently, the 
corresponding Bronsted acid possesses a single catalytically active 
bifunctional acid/base pair with a fixed geometry (Fig. 2e). The inter- 
locking of BINOL-subunits could in principle also result in the con- 
formational locking of the imidodiphosphate moiety in the O,O-anti 
conformation to give the O,O-anti-isomer of 6 (Fig. 2c). However, on 
the basis of our modelling studies we expected that the formation of the 
corresponding O,O-anti-isomer will be disfavoured for steric reasons 
due to the bulky 3,3’-substituents on the backbone. 

Following this design, we have synthesized several imidodiphosphoric 
acids based on anion 6, including 6a with 2,4,6-Et;C,H, substituents 
on the 3,3’-positions of the two BINOL backbones (Supplementary 
Table 3). It is noteworthy that imidodiphosphoric acids such as 6a 
are easily accessible. Their synthesis requires only a single additional 
synthetic step compared to the corresponding phosphoric acids (Sup- 
plementary Figs 2 and 3). Indeed, the crystal structure of imidodi- 
phosphoric acid 6a reveals a confined active site deeply buried 
within a highly sterically demanding chiral environment (Fig. 3a, 
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anion (showing O,O-syn- and O,O-anti isomers). d, Conformationally locked 
O,O-syn-imidodiphosphosphate anion with C,-symmetry (showing in green 
bulky 3,3’-substituents responsible for locking). e, Bifunctional activation of 
reactants with Bronsted acid/base pair in imidodiphosphoric acid catalysis. 
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Supplementary Fig. 4). Viewing the catalytic moiety from the top 
illustrates how the two BINOL-substituents surround the active site 
from above (Fig. 3b). Viewed from the bottom, the catalyst demon- 
strates a tight arrangement of the two remaining BINOL subunits, 
which completely block access to the imidodiphosphate (Fig. 3c). 
For comparison, the crystal structure of one of the most popular and 
sterically demanding phosphoric acids, TRIP (3a, that is, protonated 3 
with R = 2,4,6-'Pr;CH)), is given in Fig. 3d and 3e”°. The active site of 
TRIP is placed inside a broad open cone, whereas that of the imidodi- 
phosphate can better be described as a chiral pocket. Furthermore, the 
geometrical parameters of the imidodiphosphoric acid, which co-crys- 
tallized with a molecule of water, are consistent with a hydronium ion 
bridging the two imidodiphosphate oxygen atoms (Supplementary Fig. 
4, Supplementary Tables 4-6). This supports proton location on oxy- 
gen rather than on nitrogen, as shown in Fig. 2e. 

Having produced a Bronsted acid confined within an extremely 
sterically demanding chiral cavity, we turned to the spiroacetalization 
problem (Supplementary Table 3). Our catalyst design was quickly 
rewarded with the first (to our knowledge) highly enantioselective 
catalytic spiroacetalization reaction, obtaining (S)-olean from alcohol 
2a with an excellent enantiomeric ratio of 98:2 (Table 1, entry 1). The 
(R)-enantiomer of olean was easily obtained by using the enantiomer 
of the catalyst (entry 2). Imidodiphosphoric acid catalyst 6a proved 
quite general, and various other spiroacetals were obtained with high 
enantioselectivity. Only 0.1 mol% of catalyst 6a was required in the 
construction of the small 5,5-spiroacetal (entry 3). The 5,6-spiroacetal 
motif could be accessed equally successful by cyclization of either five 
or six-membered rings (entries 4 and 5). Different enol ether ring sizes 
are similarly tolerated, as demonstrated by highly enantioselective 
reactions to form the corresponding 7,6- and 7,5-spiroacetals (entries 
6 and 7). Notably, all these small spiroacetals are core structures of 
many natural products’. 

When other stereocentres are present in the target molecule, accessing 
the spiroacetal stereocentre in the most stable thermodynamic con- 
figuration is usually easy. However, the formation of fragile non- 
thermodynamic spiroacetals, having the less stable configuration of 
the spiroacetal stereocentre, presents a formidable challenge for organic 
synthesis”’°"’, An ideal solution would be provided by a catalyst that 
could override thermodynamic preference and control the spiroacetal 


Figure 3 | Comparison of crystal structures of anions of 6a (6 with 

R = 2,4,6-Et;CgH,) and TRIP. a, Crystal structure of the anion of 6a, side view 
(main figure) and schematic representation (inset). b, 6a, top view at the active 
site, space filling model. c, 6a, bottom view at the active site, space filling model. 
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configuration. Gratifyingly, with our confined Bronsted acid catalyst this 
goal could be achieved. A variety of non-thermodynamic spiroacetals 
(1g, 1i, lk in Table 1) could be formed with diastereomeric ratios ranging 
from 5:1 to 23:1, against a strong thermodynamic preference of up to 
1:124 (entries 8, 10 and 12 in Table 1). Moreover, when the catalyst 
control matches the inherent thermodynamic preference, spiroacetals 
can be obtained with excellent diastereoselectivities that can even 
exceed thermodynamic ratios (entries 9, 11, 13). In this way, 
thermodynamic spiroacetals can be obtained under very mild, kinetic 
conditions, as compared to the relatively strongly acidic conditions 
typically required for thermodynamic equilibration. It is important to 
note that kinetic spiroacetalizations of substrates 2g-2m (Table 1), 
catalysed with a simple achiral acid, favoured the thermodynamic 
product with only poor diastereoselectivity ranging from 1:1 to 1:3. 
Spiroacetalization of substituted enol ether 2m revealed another 
powerful capability of the confined Bronsted acid 6a. An efficient 
kinetic resolution occurred, delivering both bisacetal 1n and enolacetal 
(R)-2m with excellent enantioselectivity (entry 14). Asymmetric 
synthesis of these sensitive motifs by any other method would present 
a considerable challenge. Finally, to further explore the behaviour of 
our catalyst, we also examined large substrate 20, an open form of the 
steroidal sapogenin, diosgenin (entry 15). As expected, the reactivity 
was rather low using either catalyst enantiomer, although only a single 
diastereomer (the known non-thermodynamic isomer 20-epi- 
diosgenin) was obtained’*. This probably results from the inability of 
our confined acid to efficiently accommodate such large substrates. 
Whereas recent developments in Bronsted acid catalysis have enabled 
an array of transformations to be performed asymmetrically’? '°”’, sub- 
strates have typically been sterically or electronically biased. We have 
shown here that confined Bronsted acids with extreme steric demand 
and chiral pockets reminiscent of those found in enzymes can 
overcome this limitation and solve an important problem in organic 
synthesis. Our spiroacetalization reaction offers a straightforward 
catalytic enantioselective access to spiroacetal motifs, and is expected 
to find applications in the synthesis of diverse natural products and 
biologically active molecules. We believe that the concepts described 
here open the door to the development of asymmetric reactions that 
include small aliphatic and/or loosely bound molecules, and will be 
widely applicable, possibly even beyond Bronsted acid catalysis*~*°. 
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d, Crystal structure of TRIP anion (main figure) and schematic representation 
(inset). e, TRIP, top view at the active site, space filling model. Colour coding: 
dark grey, carbon; light grey, hydrogen; red, oxygen; orange, phosphorus; 
purple, nitrogen. 
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Table 1 | Catalytic asymmetric spiroacetalization 


Entry Substrate Reaction details Product Yield e.r. or dir. 
1 Cre 6a (5 mol%), -25 °C, MTBE Cy) 77% er. 98:2 
2a (S)-a 
2 fe) OH ent-6a (5 mol%), -25 °C, ome) 70% er. 97.5:2.5 
Cy~ He : 
2a (R)-1a 
3 fe) 6a (0.1 mol%), -55 °C, Qo oO 62% er. 96:4 
on CH2Cls oe 
2b 1b 
4 fe} OH 6a (1 mol%), -35 °C, DCE oO oO 81% er, 95.5:4.5 
| XO 
CY 2c 1e 
5 fe) 6a (1 mol%), -35 °C, MTBE Oo 69% er. 96:4 
cae \ 
2d 1c 
6 OH 6a (1 mol%), -25 °C, MTBE (oan) 78% er. 96:4 
ie) s 
Cy ? 
2e 
7 Cy 6a (1 mol%), -35 °C, MTBE ome) 88% er. 98.5:1.5 
2f 1f 
8 fe) OH 83% d.r. 5:1, non-thermodynamic 
| ie) Me spiroacetal (thermodynamic 
(S)-2g fe) dr. 1:124) 
; 1 
5 a 6a (5 mol%), -35 °C, MTBE g 
| z ° 89% dir. 65:1 


10 70% d.r. 23:1, non-thermodynamic 
O° OH spiroacetal (thermodynamic 
O dr. 1:9) 
(R)-2i ~/ 
6a (5 mol%), -35 °C, MTBE 
Me di 
11 = 
O mH 7 70% dir. 50:1 
: (o) 
Cy. ~ — . : 
12 fe} OH 76% dr. 7:1, non-thermodynamic 
| hie spiroacetal (thermodynamic 
(R)-2k 0 dir. 1:60) 
1k 
13 fe) OH 
| 7 6a (10 mol%), -35 °C, ° 86% dr. 100:1 
(S)-2k MTBE O 
Me 11 
14 Kinetic resolution MeO. 70% er. 97.5:2.5 (d.r. >50:1) 
MeO fe) OH Q 0 
Cr ~ (S)-2m | 1 
6a (5 mol%),-25 °C, MTBE n 
+ 
MeO, -O OH GU7ecoM 86% er. 97:3 
15 6a or ent-6a (1 mol%), 88% dr. >20:1:1:1 


HO 


20°C, DCE 


Reactions were run in the presence of molecular sieves. Substrates in entries 8 to 14 were examined simultaneously as racemic mixtures. The thermodynamic diastereomeric ratio (d.r.) was measured after 
treating substrates with 3 M HCl(aq.) at room temperature. e.r., enantiomeric ratio; MTBE, t-butylmethy! ether; DCE, 1,2-dichloroethane. 


METHODS SUMMARY 


Catalytic asymmetric synthesis of (S)-olean was performed as follows. MTBE (tert- 
butylmethyl ether) (7 ml) and 4 A molecular sieves (50 mg) were cooled to -25 °C 
ina vial equipped with a septum. A solution of 2a (0.25 mmol) in MTBE (2 ml) was 
added and the mixture stirred for 5-10min. A solution of catalyst 6a 
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(0.0125 mmol, 5 mol%) in MTBE (1 ml) was added dropwise and the mixture 


Supplementary Information. 


was stirred at -25 °C for 24h. Et;N (50 ul) was added at the same temperature, 
the mixture concentrated, and the product isolated by column chromatography. 
For complete experimental details and characterization of compounds, see 
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Uncovering the Neoproterozoic carbon cycle 


D. T. Johnston!, F. A. Macdonald’, B. C. Gill’, P. F. Hoffman’? & D. P. Schrag! 


Interpretations of major climatic and biological events in Earth 
history are, in large part, derived from the stable carbon isotope 
records of carbonate rocks and sedimentary organic matter’. 
Neoproterozoic carbonate records contain unusual and large nega- 
tive isotopic anomalies within long periods (10-100 million years) 
characterized by 5'°C in carbonate (5'°C.4,,) enriched to more 
than +5 per mil. Classically, 5'*C.a:5 is interpreted as a metric of 
the relative fraction of carbon buried as organic matter in marine 
sediments” *, which can be linked to oxygen accumulation through 
the stoichiometry of primary production*». If a change in the 
isotopic composition of marine dissolved inorganic carbon is 
responsible for these excursions, it is expected that records of 
5°C.ap and 8'°C in organic carbon (6° Cx) will covary, offset 
by the fractionation imparted by primary production’. The docu- 
mentation of several Neoproterozoic 5° Corb excursions that are 
decoupled from 6° Corg, however, indicates that other mechan- 
isms** may account for these excursions. Here we present 5'°C 
data from Mongolia, northwest Canada and Namibia that capture 
multiple large-amplitude (over 10 per mil) negative carbon isotope 
anomalies, and use these data in a new quantitative mixing model 
to examine the behaviour of the Neoproterozoic carbon cycle. We 
find that carbonate and organic carbon isotope data from Mongolia 
and Canada are tightly coupled through multiple 5'°C,,,.5 excur- 
sions, quantitatively ruling out previously suggested alternative 
explanations, such as diagenesis’* or the presence and terminal 
oxidation of a large marine dissolved organic carbon reservoir’. 
Our data from Namibia, which do not record isotopic covariance, 
can be explained by simple mixing with a detrital flux of organic 
matter. We thus interpret 8° Ccarb anomalies as recording a 
primary perturbation to the surface carbon cycle. This interpreta- 
tion requires the revisiting of models linking drastic isotope excur- 
sions to deep ocean oxygenation and the opening of environments 
capable of supporting animals”. 

There are two leading hypotheses for the documented large- 
amplitude’? Neoproterozoic carbon isotope anomalies. The first 
proposal is that the Neoproterozoic deep ocean carried a massive dis- 
solved organic carbon (DOC) reservoir®. This model stems from the 
observation of unlinked changes in 8° Ccarb and a Cae with the pos- 
tulated large DOC pool allowing for a oe records to be buffered 
against isotopic change. Another view of these large carbon isotopic 
excursions is represented by a set of hypotheses arguing for the infidelity 
of 5'°C.a:5 records. In this view, secondary alteration by meteoric waters 
or burial diagenesis is invoked to satisfy carbon isotope decoupling’*. 
Given such disparate proposals for explaining the Neoproterozoic 
carbon isotope record, and the implications for surface environments, 
climate, oxygen and animals, we aim to revisit the behaviour of the 
marine carbon cycle and test the validity of the above proposals with 
a new data set and model. 

We sampled stratigraphic sections spanning the mid-Neoproterozoic 
Cryogenian glacial intermission (<717 to >635 million years ago) at 
high resolution in Mongolia, northern Namibia and northwest Canada 
(see Supplementary Information). In Mongolia, limestone of the Tayshir 
member (of the Tsagaan Oloom Formation) is bracketed by the 


Maikhan Ul and Khongoryn diamictites'’ (Fig. 1). The Tayshir carbon 
isotope anomaly” is recorded in stratigraphic sections throughout the 
basin, commences with a distinct flooding surface, and is developed 
predominantly in limestone micrite with relatively low primary porosity. 
The Tayshir anomaly can either be correlated to moderately negative 
values in the Gruis Formation of northern Namibia’* and Bonahaven 
Dolomite of the British-Irish Caledonides’’, or to the Trezona anomaly 
in Australia'® and Namibia'”'*. The Trezona anomaly was also sampled 
through the Ombaatjie Formation in northern Namibia (see Sup- 
plementary Information), which directly underlies an erosion surface 
related to the Ghaub glaciation of 635.5 + 1.2 million years ago”. We 
further sampled the Cryogenian record from between the glaciations in 
the Mackenzie Mountains of northwest Canada (see Supplementary 
Information), where mixed carbonate and siliciclastic rocks of the 
Twitya and Keele formations are bounded by diamictites of the 
Sturtian-age Rapitan Group”’ (716.47 + 0.24 million years old) and 
Marinoan-age (~635 million years old) Ice Brook Formation. These 
strata preserve the post-Sturtian Rasthof anomaly in the Twitya 
Formation and an anomaly at the top of the Keele Formation that 
can be correlated to the Trezona anomaly”’. Although there are alterna- 
tive published correlations for particular anomalies described above, 
these correlations are not central to our argument. 

We present two 5'°C.ap and aor records through the Tayshir 
member of Mongolia (Fig. 1). The overall total organic carbon (TOC) 
contents broadly follow siliciclastic content and are highest at the base 
of the more distal section (Uliastay Gol), but remain at about 0.08% 
throughout the succession with no apparent facies dependence (see 
Supplementary Information). In the Uliastay Gol section (n = 171 
pairs), an over 10%o negative SP Coarb anomaly (the Tayshir) at a depth 
of around 150m preserves an extremely tight coupling with Or Cig 
through the entire excursion. Isotopic covariance is observed at even 
finer stratigraphic scales, such as the inflection recorded at around 
50m depth. Consistent with this, a parallel section 35 km to the east 
also preserves a tight isotopic coupling and similar net isotopic offset, 
OF toc = 8 Carb — ae One (Fig. 1; see Supplementary Information). 

In northwest Canada, the Cryogenian Twitya and Keele formations 
provide a comparison to glacial intermission records from Mongolia, 
and preserve the Rasthof and a latest-Cryogenian isotope excursion 
(Fig. 2). The post-Sturtian Twitya Formation preserves strong isotopic 
covariance between carbonate and organic carbon, and an €roc value 
that closely approximates what is extracted from Mongolia (see 
Supplementary Information). The pre-Marinoan Keele Formation 
also appears to preserve carbon isotope covariance, but records a more 
variable Eroc, ranging from values similar to those observed in 
Mongolia to that recorded in the Pleistocene epoch (see Supplemen- 
tary Information)’. In contrast to these records of tight covariance 
is the Trezona anomaly in Namibia (a TOC-poor dolomudstone, 
dolograinstone and dolomicrobialaminite; see Supplementary Infor- 
mation), in which 51°C,,,, and a one have little relationship to one 
another. This later observation is consistent with much of the pub- 
lished Neoproterozoic data®'®'!”?. Our data set thus reflects both the 
convention (isotopic decoupling) as well as strong evidence of isotopic 
covariance. 
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Figure 1 | Chemostratigraphic sections through 
the Tayshir member in the Uliastay and Tsagaan 


We used these data to evaluate the late-Cryogenian carbon cycle. The 
classic interpretation of isotopic covariance suggests that biomass is 
synthesized from a relatively well-mixed marine dissolved inorganic 
carbon (DIC) reservoir and results in an éyoc value that reflects a kinetic 
fractionation associated with carbon fixation’. This simple depiction of 
the carbon cycle cleanly explains records from Mongolia and northwest 
Canada (Figs 1 and 2). Our finding contrasts with models requiring a 
large DOC pool, which preclude covariance between 8° Ccb and 
OC age That is, data from Mongolia and northwest Canada are quan- 
titatively inconsistent with a large DOC model (Figs 1 and 2). It has also 
been argued that burial diagenesis* or meteoric alteration’ could reset 
the primary 5° Ccarb» calling into question the robustness of carbonate 
records and their use as a pre-fossil correlation tool. Rather than 
targeting the fidelity of 5'°C,,., these studies emphasize the tight covar- 
iance between 8'°C.a, and 5'°O.a, through certain negative 8'°Cea 
excursions as evidence for alteration (see Supplementary Information). 
However, the observation of a tight coupling between 8° Ccarb and 
BD Cong through several Neoproterozoic anomalies (Figs 1 and 2) rules 
out alteration as the primary cause of all large Neoproterozoic negative 
8°C.arb excursions (see Supplementary Information). The 88O carb in 
these rocks has almost certainly been reset, given fluid-rock exchange 
and the high concentration of oxygen in water, but explaining how 
exchange can reset O° Conti given the enormous mass of carbon in 
carbonate sequences is more challenging*. The correlation of O° Cax, 
and 8'8O carb is intriguing and may in fact have an environmental/ 
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diagenetic explanation (see Supplementary Information), but requires 
further investigation. Importantly, however, diagenesis is not respons- 
ible for the excursions through which 5'°C covaries (Figs 1 and 2). 

A complete understanding of the Neoproterozoic carbon cycle 
requires not only the interpretation of carbon isotope covariance 
(Figs 1 and 2)**, but also an explanation for the data that underpinned 
these alternative hypotheses. We propose that much of the existing 
Or Coie from Neoproterozoic sedimentary rocks reflects contamina- 
tion with a secondary source of organic carbon, possibly from the 
erosion of organic-rich shale on land or the migration of hydrocarbons 
within the basin. The isotopic contribution from exogenous sources 
would be most evident when TOC values are low, consistent with a 
majority of the published data’*”’. This hypothesis does not require 
that all samples with low TOC carry anomalous OCs but notes the 
greater susceptibility of these samples**** and challenges their universal 
inclusion in environmental interpretations, especially when BO Cag 
remains constant through intervals when 3° Ccarb is variable. 

To explore the hypothesis that these data are produced from both an 
exogenous component and a primary source, we present a model for 
two-component mixing (Fig. 3a, b). Here, each component carries an 
organic carbon flux (monitored using TOC) and a distinct i oo so 
the product of mixing does not require a linear relationship between 
components (see Supplementary Information). One component in the 
model is organic carbon from primary producers (A in Fig. 3a), with a 
high mass fraction and an isotopic composition offset from dissolved 
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Figure 2 | A chemostratigraphic section through the late Cryogenian period 
of northwest Canada. As in Fig. 1, red circles represent carbonate carbon and 
blue denotes paired organic carbon (n = 73). Isotope scales are equal for 
carbonate and organic carbon, and the 2-s.d. errors are smaller than the data 
points. Stratigraphic keys are as in Fig. 1, as are the TOC (wt%) calibrations. 
Abbreviations correspond to the following: Ra, Rapitan Group; T, Twitya; I, Ice 
Brook. Carbonate data in the Keele and Twitya formations is from ref. 21 and 
overall trends are consistent with ref. 4. 


inorganic carbon by éyoc. The second component (B in Fig. 3a) is 
exogenous, and could be terrigenous (detrital) inputs, secondary 
hydrocarbons, or contamination at some point on the rock’s life history 
(with an isotopic composition in the range of typical organic matter). 
Post-depositional processes, such as heterotrophic remineraliza- 
tion, will also alter mixed contributions (Fig. 3b). When then compared 
to. 6 Cack (Fig. 3c, d), this simple mixing model fully describes 


Cryogenian and Ediacaran records without invoking alteration”® 
or unfamiliar ocean chemistry*’*"'”*. Our model does not require 
developing and oxidizing an ocean where DOC overwhelms DIC 
multiple times throughout the Neoproterozoic era, or driving extensive 
global alteration of carbonate platform sediments. This does not pre- 
clude contributions from DOC”, but the observation of isotopic 
covariance at the scales observed requires DOC to be significantly 
subordinate to DIC™*. Therefore, we find our model more plausible 
from a physical and chemical oceanographic standpoint, because it 
requires only established, well-understood processes (that is, 
weathering, a circulating ocean and remineralization). It follows from 
these conclusions that 8'°C.ar may be a more faithful indicator of 
environmental change, and OOo should be viewed critically, espe- 
cially when TOC is low and invariant. 

Given that 38°C. may well faithfully preserve primary environ- 
mental information, we revisit the reading of this record. The conven- 
tional interpretation of the carbon cycle and isotope records uses a 
simplification of the full carbon flux equation for isotopic mass balance, 
where 5'°C,o) = [8 Cog X forg] + [5 '°Coarp X (1 — forg)], and forg is the 
relative fraction of carbon buried as organic matter. In the full carbon 
flux equation, 5'°C inputs reflect a weighted average of volcanism 
(carbon entrained during subduction and mantle outgassing) and 
carbonate and organic carbon weathering”, and the sinks include 
both carbonate and organic matter burial. So if we consider the full 
carbon flux equation, other ways of driving 5'°C variations become 
apparent. Each of the above fluxes can change in magnitude and iso- 
topic composition, and each carry specific consequences for climate and 
oxidant budgets. Also, the Neoproterozoic 38°C excursions occur in 
tens to hundreds of metres of shallow-water strata (third-order 
sequence tracts), implying a characteristic timescale of 10° to 10° years 
(ref. 27). It is challenging to produce such large (about 10%bo) isotopic 
excursions, which in certain cases plunge beyond the canonical mantle 
value, on these timescales without the input of an enormous mass of 
low-8'°C carbon, presumably in the form of either organic matter or 
methane”. This requisite light carbon load may, in part, reflect the mass- 
balance counterpart to enriched background Cryogenian 82 Cub 
values. Such an injection of carbon would also significantly raise atmo- 
spheric pco, or pcx, and consume existing oxidant reservoirs”, which is 
difficult to reconcile with the close stratigraphic association of at least 
some of the excursions and glacial deposits*®. Another possibility is that 
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the oxidation of reduced carbon is accomplished with SO,” or Fe 
oxides as the electron acceptor’, rather than O), which may allow for 
some stabilization of pco, through the effect on alkalinity”. If the dura- 
tion of these events is found to be longer, then carbon mass input argu- 
ments should be modified accordingly, but the relevant reservoirs 
interacting with the surface carbon cycle remain unchanged. 

The data and analysis presented here illustrates that 3'°Coabp pre- 
serves a faithful snapshot of the Neoproterozoic surface carbon cycle. 
This suggests that there is an environmental explanation for the 
enriched 8'°Ccab background values, and that at least some of the 
10°-to-10°-year negative 8 Cink excursions reflect primary features 
of the surface carbon cycle. These anomalies thus require the episodic 
input of isotopically light carbon, presumably methane or organic 
matter, as well as the cyclic consumption and production of electron 
acceptors such as sulphate and O). Our data do not eliminate diage- 
netic explanations for particular excursions throughout the world, but 
the covariation evident during many of the excursions does suggest a 
common explanation for all of them. Whether these excursions are 
driven by the recycling of stored carbon in the form of methane 
hydrate or ancient organic matter in sediments, or by the dynamics 
of nutrient-dependent productivity in the ocean remains uncertain. 
Consistent with both is the idea is that these excursions simply reflect a 
series of failed transitions from the more reducing and biologically 
simple Proterozoic world towards the more oxidizing and biologically 
complex Phanerozoic Earth on which we now live. 


METHODS SUMMARY 


All isotope data are presented relative to the Vienna PeeDee Belemnite (VPDB) 
standard and calculated according to standard delta notation and reported in units 
of %o. Carbonate carbon and oxygen isotope measurements were made as one 
individual measurement, with the organic carbon isotopic composition measured 
separately, but on the same hand sample. Carbonate carbon isotopes were pre- 
pared by cutting hand samples to expose a fresh surface followed by micro-drilling 
0.001 g. A small aliquot of this powder was analysed on a VG Optima run in dual 
inlet mode, abutted to a common acid bath preparation device. Samples were 
acidified in H3PO, at 90°C. The evolved CO, was cryo-focused and analysed 
against an in-house standard reference gas. The reproducibility of this measure- 
ment was better than 0.1%o (one standard deviation). Organic carbon analyses 
were performed on large (about 10 g) samples to accommodate low TOC values. 
Samples were decalcified with concentrated HCl for 48h, buffered back to a 
neutral pH (>pH5), filtered and dried. The mass of insoluble residue was taken 
as siliciclastic content. Homogenized residues were analysed on a Carlo Erba 
Elemental Analyzer attached to a ThermoFinnigan Delta V configured in con- 
tinuous flow mode. Samples and standards were bracketed such that our >350 
unique organic carbon analyses (each run in duplicate) were associated with 370 
internal standards. These standards have known organic carbon contents and 
isotope values, and were used to calibrate TOC contents and isotopic composi- 
tions. Estimates of error are also reported in the text. The simple mixing model 
underpinning Fig. 3 is a standard multi-component mixing scenario. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


All isotope data are presented relative to the Vienna PeeDee Belemnite (VPDB) 
standard and calculated according to standard delta notation and reported in units 
of %o. Carbonate carbon and oxygen isotope measurements made as one indi- 
vidual measurement, with the organic carbon isotopic composition measured 
separately, but on the same hand sample. Carbonate carbon isotopes were pre- 
pared by cutting hand samples to expose a fresh surface followed by micro-drilling 
0.01 g. A small aliquot of this powder was analysed on a VG Optima run in dual 
inlet mode, abutted to a common acid bath preparation device. Samples were 
acidified in HPO, at 90°C. The evolved CO, was cyro-focused and analysed 
against an in-house standard reference gas. The reproducibility of this measure- 
ment was better than 0.1%o (one standard deviation). Organic carbon analyses 
were performed on large (about 10 g) samples to accommodate low TOC values. 
Samples were decalcified with concentrated HCl for 48h, buffered back to a 
neutral pH (>pHS5), filtered, and dried. The mass of insoluble residue was taken 
as siliciclastic content. Homogenized residues were analysed on a Carlo Erba 
Elemental Analyzer attached to a ThermoFinnigan Delta V configured in con- 
tinuous flow mode. Samples and standards were bracketed such that our >350 
unique organic carbon analyses (each run in duplicate) were associated with 370 


internal standards. These standards have known organic carbon contents and 
isotope values, and were used to calibrate TOC contents and isotopic composi- 
tions. Estimates of error are reported in the text. 

The model presented in the manuscript is a multi-component mixing scenario, 
herein defined by a hyperbolic relationship. The equation represents a style of two- 
component mixing, where: 


(8°Crocm = (8° Croc)a X fa X (Xa/Xm) + (8° Crocs X (1 — fx)(Xp/Xm) 


with the subscripts M, A and B representing the mixture, end-member A and end- 
member B. There are two weighting factors f, and (1 — f4), which are fractional 
terms seen in the equation describing linear mixing: 


Xu = Xa X fa + Xp X (1 — fa) 


where X is mass. In solving this, we can choose the end-member compositions— 
Xa, (8° Croc)a, Xp and (8'°Cyroc)p—and allow f, to vary from 0 to 1 (this is a 
relative flux term). This allows us to solve for X,\, (from equation (2) and then 
(8'°Croc)m from equation (1)). For application in Fig. 3, we can include the 
8'°C.arp on the same sample. 
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On the difficulty of increasing dental complexity 


Enni Harjunmaa’, Aki Kallonen’, Maria Voutilainen', Keijo Hamaldinen?, Marja L. Mikkola! & Jukka Jernvall! 


One of the fascinating aspects of the history of life is the apparent 
increase in morphological complexity through time’, a well known 
example being mammalian cheek tooth evolution” *. In contrast, 
experimental studies of development more readily show a decrease 
in complexity, again well exemplified by mammalian teeth, in which 
tooth crown features called cusps are frequently lost in mutant and 
transgenic mice*’. Here we report that mouse tooth complexity can 
be increased substantially by adjusting multiple signalling pathways 
simultaneously. We cultured teeth in vitro and adjusted ectodysplasin 
(EDA), activin A and sonic hedgehog (SHH) pathways, all of which 
are individually required for normal tooth development. We quan- 
tified tooth complexity using the number of cusps and a topographic 
measure of surface complexity*. The results show that whereas 
activation of EDA and activin A signalling, and inhibition of SHH 
signalling, individually cause subtle to moderate increases in com- 
plexity, cusp number is doubled when all three pathways are adjusted 
in unison. Furthermore, the increase in cusp number does not result 
from an increase in tooth size, but from an altered primary pattern- 
ing phase of development. The combination of a lack of complex 
mutants”’, the paucity of natural variants with complex pheno- 
types’, and our results of greatly increased dental complexity using 
multiple pathways, suggests that an increase may be inherently dif- 
ferent from a decrease in phenotypic complexity. 

Dental complexity, typically measured as the number of tooth 
crown features such as cusps, is closely linked to an animal’s diet. 
Mammalian species with primarily plant diets have high cheek tooth, 
or molar, complexity whereas species with animal diets have low com- 
plexity, and this pattern seems to hold irrespective of phylogeny*”®. 
During molar development, cusps form at the positions of epithelial 
signalling centres, the secondary enamel knots’’. Each secondary 
enamel knot is a replay of the same developmental module, hence each 
enamel knot expresses the same set of regulatory genes’. This evidence 
indicates that mammalian molar complexity is developmentally rela- 
tively simple and adjusting the dynamics of patterning should readily 
produce both a decrease and an increase in cusp numbers. 

An increasing number of genes are known to affect molar develop- 
ment in the mouse*”’. Of the at least 29 individual gene mutations 
known, 13 cause the arrest of tooth development, 7 cause a decrease 
in cusp number, 6 cause modifications in cusp patterns, and 3 cause the 
formation of one additional cusp (Supplementary Table 1). These 
patterns contrast with multiple examples of evolutionary increase in 
cusp number? **"°, and with the paucity of evolutionary decrease in 
cusp number’*"*. Furthermore, both syndromic and normal variation 
in human dentition predominantly cause simplification or complete 
loss of teeth**"’, agreeing with the data from mice but contrasting with 
the macroevolutionary patterns. 

The prevalence of simple mutants is perhaps to be expected because 
most mutations may compromise individual signalling pathways 
required for organ development. Furthermore, the lack of complex 
mutants could be due to the relatively few gain-of-function mutations 
studied, or few mutations in feedback inhibitors of major signalling 
pathways**° (Supplementary Table 1). This missing ‘complexity gene’ 
scenario, however, is not the only plausible hypothesis because a 


substantial increase in complexity can also be proposed to require 
simultaneous changes in several signalling pathways (or multiple 
changes in a single pathway). This requirement of multiple changes 
can be conceptualized as “economics of signalling’ in which, for 
example, increasing signalling through one pathway alone may deplete 
other signals required to produce a greater number of cusps. In genetic 
terms, this would approximate a polygenic effect. 

To explore the contrasting ideas about increasing complexity, we 
tested whether adjusting individual signalling pathways alone or in 
combination could be used to increase tooth cusp number. We cul- 
tured mouse molars while adjusting three signalling pathways, EDA, 
activin A and SHH. 

EDA, a diffusible tumour necrosis factor (TNF) family protein, is 
expressed in the outer epithelium surrounding the forming crown 
and modulates the activity of enamel knots through its receptor which 
is expressed in the knot cells’®. Lack of EDA signalling approximately 
halves the number of cusps in a mouse molar, causing rounder and more 
closely spaced cusps'*’”. Superfluous EDA signalling under the keratin- 
14 (K14) promoter causes opposite changes in cusp shape and spacing, 
but only a subtle increase in cusp number”. Activin BA (also known as 
inhibin BA) in turn is mesenchymally expressed underneath the enamel 
knots, and mediates epithelial-mesenchymal interactions required for 
tooth development”®. Lack of activin A arrests lower molar development 
before the formation of the primary enamel knot'®. Superfluous activin 
A applied next to a developing second molar accelerates its develop- 
ment!’, but the effects on cusp numbers are unknown. SHH has been 
implicated by computational modelling to laterally inhibit formation of 
enamel knots”, and experimental studies in mouse have shown SHH to 
inhibit formation of cusps and fusion of adjacent teeth*’”*. Whereas Shh 
is initially expressed only in the enamel knots, the SHH pathway is 
principally active in the surrounding tissue”’. Total lack of SHH signal- 
ling causes the arrest of tooth development after the formation of the 
primary enamel knot”, but the more subtle effects of SHH on cusp 
number remain to be quantified. 

To tabulate the appearance of enamel knots and cusps in culture 
conditions, we cultured lower first molar tooth germs of mice that express 
green fluorescent protein (GFP, fused with Cre-recombinase) in the 
Shh locus (hereafter called ShhGFP mice) as previously described’. 
Whereas Shh is initially expressed in the enamel knots, during differ- 
entiation expression spreads throughout the inner enamel epithelium, 
thereby the epifluorescence of ShhGFP can be used to monitor the 
development of both cusp number and crown shape (Fig. 1a). First, 
we determined whether EDA, activin A and SHH can be adjusted 
individually to increase cusp number. We cultured molars starting at 
embryonic day 13, just before the primary enamel knot starts to appear. 
Teeth were treated for 4 days, covering the period of cusp patterning of 
the first molar, and cusp numbers were tabulated after 8 days in culture 
(Methods and Supplementary Information). 

The results show that the effects of EDA are subtle, lacking apparent 
increase in cusp number (Fig. 1b). Because the culture conditions limit 
the detection of phenotypic details, we additionally tested the effects of 
EDA protein on Eda-null mutants which typically develop two to three 
cusps’’. In these cultures the wild-type cusp number was reached at 
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Figure 1 | Cusp number can be increased in cultured teeth. a, Culturing 
ShhGFP molar explants from day 13 allows monitoring of the cusp patterning 
and testing the effects of proteins in the culture medium. For example, activin A 
causes the appearance of additional cusps lingually (arrowheads). b, Culturing 
teeth with EDA (0.5 pg ml '), activin A (0.5 ug ml ~ ') and cyclopamine (2 1M) 
in the medium causes larger effects on patterning, including additional cusps 
(arrowheads) when administered in combinations. A dashed line shows the 
border between the first and the second molars. Teeth were treated for 4 days 
and cusps were tabulated at 8 days. Anterior is towards the left, buccal towards 
the top. M1, first molar; M2, second molar. Scale bar, 0.5 mm. 


0.1 pg ml*, and higher EDA concentrations increased crown size only 
(Supplementary Fig. 1 and Supplementary Table 2). Activin A, in 
contrast, readily caused the formation of additional cusps although 
high concentrations interfered with growth (Supplementary Fig. 1 and 
Supplementary Table 2). New cusps were added lingually (Fig. 1), 
which is also the side that grows more during early molar formation". 
The three-row cusp configuration of activin A-treated molars is remin- 
iscent of, for example, molars of multituberculate mammals. For SHH, 
we first confirmed the polarity of SHH signalling in regulating cusps 
by culturing teeth with either SHH protein or the alkaloid SHH 
inhibitor cyclopamine*’*®. The results show that SHH stalls and 
cyclopamine promotes cusp development, hence SHH signalling 
seems to function as an inhibitor of cusp patterning (Supplementary 
Fig. 2). Cyclopamine also reduced crown size (Supplementary Fig. 1 
and Supplementary Table 2), agreeing with the role of SHH in pro- 
moting growth and cell survival’*”*””. 

To maximize cusp numbers while minimally impeding growth 
using EDA, activin A and cyclopamine, we determined the optimum 
concentrations to be 0.5 ug ml ', 0.5 4gml _' and 241M, respectively 
(Supplementary Table 2 and Fig. 2a). The corresponding average cusp 
numbers are 5.4, 7.8 and 6.8 (control being 5.6), and changes in crown 
size are +3%, —17% and —44%, respectively. Next we tested the com- 
bined effects of these concentrations of EDA, activin A and cyclopamine 
on cusp number. 

Compared to the single treatments, double-combination cultures 
showed a greater number of cusps (average cusp numbers are 8.0, 
9.0 and 10.1, Figs 1b and 2a), with the triple combination producing 
the highest number of cusps, effectively doubling the wild-type cusp 
number (average cusp number is 12.5, Figs 1b and 2a, Supplemen- 
tary Tables 3 and 4). Additionally, the cyclopamine, double and triple 
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Figure 2 | Number and density of cusps increase with the number of 
adjusted signalling pathways. a, b, Number of cusps calculated for the first 
molars (a) and combined first and second molars (b). In vivo first molar (a) and 
combined first and second molar (b) cusp numbers are marked with dotted 
horizontal lines. c, d, Cusp densities calculated for the first molars (c) and 
combined first and second molars (d) show the largest increase using 
cyclopamine and the smallest increase using EDA. Boxes enclose 50% of 
observations; the median and mean are indicated with a horizontal bar and 
circle, respectively, and whiskers denote range. n = 10 for each treatment, 
except (due to uncertain cusp identity) n = 8 for A,C and E,C in (a), A,C and 
E,A,C in (c), and n = 7 for E,C in (c). Co, control; E, EDA; A, activin A; C, 
cyclopamine. 


treatments caused fusion of the first and second molars (Fig. 1b). 
Whereas the daily monitoring of tooth cultures allowed us in most 
cases (76 out of 80) to identify cusps belonging to each molar, we also 
tabulated the results using the combined first and second molar cusp 
numbers (Fig. 2b). Both tabulations show that treatment effects are 
moderately additive, and correlations between the number of manipu- 
lated pathways and the number of cusps are r, = 0.79 for the first molar 
and r,=0.85 for the combined first and second molars (both are 
P<0.001, nonparametric Spearman correlation, Supplementary Fig. 3 
and Supplementary Table 5). Even though the second molar does not 
normally develop well in culture conditions due to inhibition from the 
first molar’’, the double and triple treatments produced more cusps than 
in erupted first and second molars combined (Fig. 2b). Furthermore, the 
triple-treatment molars frequently exceed the cusp number of the 
complete mouse molar row. 

Previously cusp number has been experimentally increased by 
recombining mouse molar epithelium with the larger molar mesenchyme 
of the rat”*, and by tissue engineering protocols recombining large 
quantities of mouse dental tissues”’. In both cases teeth become large 
without changing the density of cusps’’’*. Because evolutionary 
increase in cusp number has occurred irrespective of size®, and because 
species differences in cusp number appear during the early patterning 
phase of development"', we tabulated cusp densities in our cultured 
teeth. The results show that increases in cusp number are attained 
through higher packing of cusps, cyclopamine causing the highest 
increase in cusp density (Fig. 2c, d). Taken together, downregulating 
SHH signalling decreases inhibition between enamel knots, whereas 
EDA and especially activin A promote enamel knot formation. These 
somewhat complementary effects of the three pathways suggest that 
overcoming bottlenecks in an individual pathway by simultaneously 
adjusting multiple steps (signalling molecule, receptor and intracellular 
components) might be less effective in increasing complexity. 
Nevertheless, our subtle results from EDA-treated molars could be 
due to downstream bottlenecks, especially because molar growth 
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Figure 3 | Tomography reveals high surface complexity of treated molars. 
a, Epifluorescence of ShhGFP teeth grown without (top) and with EDA, activin 
A and cyclopamine (bottom) in culture medium. b, Three-dimensional 
reconstructions of the future enamel-dentin junction from soft tissue X-ray 
microtomographs reveal a close correspondence to the epifluorescence of the 
epithelium (a). c, Slope orientation maps (orientations indicated on the colour 
wheel) used to calculate OPC values (number of coloured patches). Anterior is 
towards the left. Scale bar, 0.2 mm. 


remains uncompromised. However, when we simultaneously increased 
EDA and its receptor EDAR by crossing mice overexpressing both 
genes, the results remain subtle; a distinct new feature being a cingulid, 
a girdle like structure present in many species but missing in the mouse 
(Supplementary Fig. 4). These results in turn indicate that EDA affects 
complexity by fine tuning crown-width-linked features. 

The high density of cusps attained raises the question about the 
maximum possible cusp density. To examine our cultured teeth in 
detail, we made soft tissue X-ray microtomography scans” of selected 
teeth and reconstructed three-dimensional shapes of the future 
enamel-dentin junctions. The reconstructions show that images of 
cultured ShhGFP teeth are representative of the cusp patterns 
(Fig. 3a, b). The average cusp diameter of the triple-treatment tooth 
is 107 um (range from 58 to 190 1m), thereby cusps have roughly 10 
cells across in the mesenchyme. In principle a limited number of 
additional cusps could fit in if the largest cusps became smaller, a 
change likely to require further changes in signalling. 

Finally, we analysed the three-dimensional reconstructions using 
the orientation patch count (OPC) measure of dental complexity’. 
OPC depicts the number of slopes distinguished by differences in slope 
orientation irrespective of slope steepness, thereby providing a close 
correspondence between enamel-dentin junction and tooth surface*”. 
The high cusp count in the triple-treatment tooth is reflected in the 
high OPC (Fig. 3c), and the crown topography is comparatively similar 
to the third molar of the giant panda analysed previously*. On the basis 
of OPC values among mammals, including rodents’, altering mouse 
molar patterning produced complexities associated with fibrous plant 
specializations. 

In conclusion, the lack of complex mutants, the high complexity 
attained adjusting more than one pathway, and the lack of comparable 
variation in natural populations, all point to a general bias against 
increase in dental complexity. Yet, the predominant macroevolutionary 
patterns indicate that ecological factors have been more than adequate in 
increasing dental complexity. One possible effect of the bias against new 
cusps could be that, everything else being equal, an increase in dental 
complexity is slower than a decrease. Of course, yet to be uncovered 
‘complexity genes’ may individually increase cusp numbers. Until 
further evidence, however, we suggest that teeth, and biological struc- 
tures in general, tend to follow economics of signalling where an 
increase in complexity beyond the normal variation present in a popu- 
lation requires multiple changes in developmental regulation. 
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Lower molar tooth germs were dissected from heterozygous ShhGFP mouse 
embryos” at day 13 post fertilization as described previously’’. Explants were 
photographed daily until day eight of culture, and cusp numbers and identities 
were followed using the epifluorescence marking the formation of each enamel 
knot and cusp. We modified the activity of signalling pathways extracellularly, 
which allowed relatively precise control of timing and dosage in experimental 
settings. EDA, activin A and SHH signalling pathways were adjusted first indi- 
vidually to determine the concentrations to maximally increase cusp number. 
Next, teeth were cultured with combinations of EDA, activin A and SHH inhibitor 
cyclopamine. All treatments were done by adding the ligand or inhibitor into the 
culturing medium for the first 4 days in culture. Ten molars were used for each 
treatment and developmental characteristics (cusp number, tooth size and cusp 
identity) were identified and measured from images using ImageJ 1.43 (http:// 
rsbweb.nih.gov/ij/) and analysed using PAST (http://folk.uio.no/ohammer/past/ 
index.html). The subtle effects of EDA signalling on crown morphology were 
tested further by using both Eda-null mutants and mice overexpressing Eda and 
its receptor. X-ray tomography, with a custom-built sCT system Nanotom 180 
NE, was used to obtain three-dimensional scans of cusp patterns (with an effective 
voxel size of 1 ym). ImageJ was used to manually segment the mesenchyme from 
the epithelium and Image] 3D Viewer plugin was used to render the epithelium- 
mesenchyme interface and OPC, with 50 data rows in a tooth, was calculated as 
previously described®. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 28 September 2011; accepted 17 January 2012. 
Published online 7 March 2012. 


1. Carroll, S.B. Chance and necessity: the evolution of morphological complexity and 
diversity. Nature 409, 1102-1109 (2001). 

2. Hunter, J. P. & Jernvall, J. The hypocone as a key innovation in mammalian 
evolution. Proc. Nat! Acad. Sci. USA 92, 10718-10722 (1995). 

3. Kielan-Jaworowska, Z., Cifelli, R. L. & Luo, Z.-X. Mammals from the Age of Dinosaurs - 
Origins, Evolution, and Structure (Columbia Univ. Press, 2004). 

4. Luo, Z.-X. Transformation and diversification in early mammal evolution. Nature 
450, 1011-1019 (2007). 

5. Bei, M. Molecular genetics of tooth development. Curr. Opin. Genet. Dev. 19, 
504-510 (2009). 

6. Nieminen, P. Genetic basis of tooth agenesis. J. Exp. Zool. B. 312B, 320-342 
(2009). 

7. Charles, C. et al. Modulation of Fgf3 dosage in mouse and men mirrors evolution of 
mammalian dentition. Proc. Nat! Acad. Sci. USA 106, 22364-22368 (2009). 

8. Evans, A. R., Wilson, G. P., Fortelius, M. & Jernvall, J. High-level similarity of 
dentitions in carnivorans and rodents. Nature 445, 78-81 (2007). 

9. Miles, A. E. W. & Grigson, C. Colyer’s Variations and Diseases of the Teeth of Animals 
(Cambridge Univ. Press, 2003). 

10. Santana, S. E., Strait, S. & Dumont, E. R. The better to eat you with: functional 
correlates of tooth structure in bats. Funct. Ecol. 25, 839-847 (2011). 

11. Jernvall,J., Keranen, S. V. E. & Thesleff, |. Evolutionary modification of development 
in mammalian teeth: quantifying gene expression patterns and topography. Proc. 
Natl Acad. Sci. USA 97, 14444-14448 (2000). 

12. Kay,R.F. “Giant” tamarin from the Miocene of Colombia. Am. J. Phys. Anthropol. 95, 
333-353 (1994). 

13. Uhen, M. D. Evolution of marine mammals: back to the sea after 300 million years. 
Anat. Rec. 290, 514-522 (2007). 

14. Gabris, K., Fabian, G., Kaan, M., Rozsa, N. & Tarjan, |. Prevalence of hypodontia and 
hyperdontia in paedodontic and orthodontic patients in Budapest. Community 
Dent. Health 23, 80-82 (2006). 

15. Mikkola, M. L. TNF superfamily in skin appendage development. Cytokine Growth 
Factor Rev. 19, 219-230 (2008). 

16. Griineberg, H. Genes and genotypes affecting the teeth of the mouse. J. Embryol. 
Exp. Morphol. 14, 137-159 (1965). 

17. Kangas, A. T., Evans, A. R., Thesleff, |. & Jernvall, J. Nonindependence of 
mammalian dental characters. Nature 432, 211-214 (2004). 

18. Ferguson, C. A. et a/. Activin is an essential early mesenchymal signal in tooth 
development that is required for patterning of the murine dentition. Genes Dev. 12, 
2636-2649 (1998). 

19. Kavanagh, K. D., Evans, A. R. & Jernvall, J. Predicting evolutionary patterns of 
mammalian teeth from development. Nature 449, 427-432 (2007). 

20. Salazar-Ciudad, |. & Jernvall, J. A computational model of teeth and the 
developmental origins of morphological variation. Nature 464, 583-586 (2010). 

21. Cho, S.-W. et al. Interactions between Shh, Sostdc1 and Wnt signaling and a new 
feedback loop for spatial patterning of the teeth. Development 138, 1807-1816 
(2011). 

22. Ahn, Y., Sanderson, B. W., Klein, O. D. & Krumlauf, R. Inhibition of Wnt signalling by 
Wise (Sostdc1) and negative feedback from Shh controls tooth number and 
patterning. Development 137, 3221-3231 (2010). 


©2012 Macmillan Publishers Limited. All rights reserved 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


Dassule, H. R., Lewis, P., Bei, M., Maas, R. & McMahon, A. P. Sonic hedgehog 
regulates growth and morphogenesis of the tooth. Development 127, 4775-4785 
(2000). 

Harfe, B. D. et al. Evidence for an expansion-based temporal Shh gradient in 
specifying vertebrate digit identities. Ce// 118, 517-528 (2004). 

Taipale, J. et al. Effects of oncogenic mutations in Smoothened and Patched can be 
reversed by cyclopamine. Nature 406, 1005-1009 (2000). 

Chen, J. K., Taipale, J., Cooper, M. K. & Beachy, P. A. Inhibition of Hedgehog 
signalling by direct binding of cyclopamine to Smoothened. Genes Dev. 16, 
2743-2748 (2002). 

Ishida, K. etal. The regulation of tooth morphogenesis is associated with epithelial 
cell proliferation and the expression of Sonic hedgehog through epithelial- 
mesenchymal interactions. Biochem. Biophys. Res. Commun. 405, 455-461 
(2011). 

Cai, J. et al. Patterning the size and number of tooth and its cusps. Dev. Biol. 304, 
499-507 (2007). 

Metscher, B. D. MicroCT for developmental biology: a versatile tool for high- 
contrast 3D imaging at histological resolutions. Dev. Dyn. 238, 632-640 (2009). 
Skinner, M. M. et al. Brief communication: contributions of enamel-dentine 
junction shape and enamel deposition to primate molar crown complexity. Am. J. 
Phys. Anthropol. 142, 157-163 (2010). 


LETTER 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We thank I. Thesleff, P. Munne, A. R. Evans, |. Corfe, J. Moustakas, 
M. Murtoniemi, |. Salazar-Ciudad, S. Sova, J.-P. Suuronen and S. Zohdy for discussions 


or help; R. Santalahti, R. Savolainen and M. Makinen for technical assistance; 
M. Hyvénen for the activin A protein; P. Schneider for the Fc-EDA-A1-protein; and 


C. Tabin and A. Gritli-Linde for the ShhGFP mice. This study was funded by the Academy 


of Finland, the Sigrid Juselius Foundation, the Finnish Cultural Foundation, and the 
graduate school GSBM. 


Author Contributions E.H. and J.J. designed the study. E.H. performed developmental 
experiments and measurements. M.L.M. and M.V. designed and performed Eda;Edar 


transgenic mouse experiments. K.H. and A.K. designed and performed 


microtomography imaging. E.H. and J.J. analysed the data and wrote the manuscript 


with contributions from the other authors. JJ. coordinated the study. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of this article at 
www.nature.com/nature. Correspondence and requests for materials should be 
addressed to J.J. (jernvall@fastmail.fm). 


15 MARCH 2012 | VOL 483 | NATURE | 327 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


METHODS 


Molar cultures. Lower first molar germs were dissected from heterozygous 
ShhGFP mouse embryos™ at day 13 post fertilization (day 13) and cultured at 
37 °C with 5% CO, using a Trowell type organ culture as described previously*’. 
Molar germs were separated from the jaw tissue, which if left in place, would grow 
and stunt the development of molars in culture. Media was replaced every 2 days 
and ascorbic acid was added (100 tg ml *, Sigma-Aldrich). The epifluorescence of 
ShhGEP was used to select only molars that were at a similar stage of development 
for treatments. Additionally ShhGFP mice were crossed with Eda-null mice (Stock 
000314, Jackson Laboratory). 

Molar treatments. We modified the activity of signalling pathways extracellularly 
because it allowed quantitative manipulations. EDA, activin A and SHH are 
examples of different pathways regulating tooth development, and their choice 
for the experiments was based on previous results on their effects on patterning, 
theoretical models, and the availability of suitable molecules to adjust signalling 
activity. All treatments were for the first 4 days, after which the first molar pattern- 
ing is mostly complete, and the second molar is partially complete. We note that 
our strategy of maximising cusp number with a single factor (the ligand) in each 
pathway could in principle be surpassed by also adjusting intracellular compo- 
nents, but these alterations would qualify as multiple changes in developmental 
regulation. 

The phenotypic effects of recombinant Fc-EDA-A1 (ref. 32), activin A*, SHH 
(catalogue no. 1314-SH, R&D Systems) proteins and cyclopamine (catalogue 
no. C4116, Sigma) were tested using different concentrations. Dilutions were 
made into bovine serum albumin 0,1% (BSA, Sigma) in PBS for the proteins 
and into dimethylsulphoxide (DMSO) for cyclopamine. Tested EDA concentra- 
tions were 0.5 and 0.75 pgm! using wild-type teeth, and 0.01, 0.1, 0.4 and 
1.01g ml‘ using Eda-null mutant teeth. Tested activin A concentrations were 
0.1, 0.2, 0.5, 0.75, 3 and 5 1g ml '. Tested SHH concentrations were 3 and 5 Lig 
ml! and cyclopamine concentrations were 0.5, 1, 2 and 4 tM. Only the 3 pg ml! 
concentration seems to be informative for SHH in tooth cultures because the 
5 ugml ' concentration stopped development, and lower SHH concentrations 
had no visible effects (EH. and J.J., unpublished observations). Because high 
dosages of recombinant proteins may compromise tissue growth, either directly 
or through impurities, in the Fig. 2 experiments we used concentrations that 
maximally increased cusps while minimally impeding growth. We note that we 
cultured heterozygous ShhGFP molars because the ShhGFP construct is a Shh null 
allele. Although previous studies have reported the development and morphology 
of heterozygous ShhGFP mice to be normal'*, we tested the effects of SHH and 
cyclopamine on wild-type molars and the results remained essentially the same 
(Supplementary Fig. 2). 

To examine the subtle effects of EDA signalling on crown morphology, Eda 
and its receptor Edar were simultaneously overexpressed in transgenic mice under 
the K14 promoter. To obtain compound K14-Eda; K14-Edar transgenic mice, 


K14-Eda mice’ were crossed with K14-Edar mice™ and teeth of adult mice were 
analysed using X-ray tomography (using 3 um voxel size). 

Quantitative analyses of experimental data. Cultures were photographed daily 
using both visible light (Olympus SZX9) and fluorescent (Leica MZFLIII) micro- 
scopy. On the last day of culturing, extra tissue surrounding the molar was removed 
before photography. Ten unpaired molars from at least three different litters were 
used for each treatment, thus a total of 80 teeth and 1,280 images were analysed for 
the cusp number tabulations. Developmental characteristics (cusp number, tooth 
size and cusp identity, Supplementary Table 3) were identified and measured from 
images using ImageJ 1.43 (http://rsbweb.nih.gov/ij/) and analysed using PAST 
(http://folk.uio.no/ohammer/past/index.html). Reliable identification of the two 
merged cusps forming the anteroconid of the first mouse molar is difficult in cul- 
tured teeth, and we therefore tabulated the anteroconid, when present, as one cusp. 
X-ray microtomography and OPC analyses. To examine cusp patterns of the 
relatively large teeth starting to form hard tissues, we used X-ray tomography to 
reconstruct three-dimensional cusp patterns. Molars were fixed with 4% para- 
formaldehyde (PFA), dehydrated into 70% ethanol, and dyed with phosphotungstic 
acid (catalogue no. P4006, Sigma) for 24h”. The samples were scanned using a 
custom-built ,tCT system Nanotom 180 NF (phoenix|X-ray Systems + Services 
GmbH) with a CMOS detector (Hamamatsu Photonics) and a high-power 
transmission-type X-ray nanofocus source with a tungsten anode. The samples 
were imaged with 80 kV acceleration voltage and 180 1A tube current. Projection 
images were acquired over a full circle of rotation with 0.25° angular intervals, and 
each projection image was composed of the average of 10 transmission images with 
750 ms exposure time. The measurement geometry resulted in an effective voxel 
size of 1 um. The reconstruction from the projection images was performed with 
software datos|x rec supplied by the system manufacturer. Mesenchyme was 
segmented from epithelium after down-sampling (to 2m resolution) using 
Image] (Supplementary Fig. 5). ImageJ 3D Viewer plugin was used to render the 
epithelium—mesenchyme interface which was cleaned (0.5 cluster decimation and 
two steps of Laplacian smoothing) in MeshLab 1.3.0 (http://meshlab.sourceforge. 
net/). From the digital elevation models (DEMs) we calculated OPC as previously 
described*. To obtain tooth OPC values comparable to previous studies, the DEMs 
were down-sampled into a grid having 50 data rows along the tooth length. 
Reconstructions of adult mouse teeth were visualised using Avizo Fire 
(Visualization Sciences Group). 
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Coevolution in multidimensional trait space favours 
escape from parasites and pathogens 


R. Tucker Gilman!, Scott L. Nuismer* & Dwueng-Chwuan Jhwueng't 


Almost all species are subject to continuous attack by parasites and 
pathogens. Because parasites and pathogens tend to have shorter 
generation times’” and often experience stronger selection due to 
interaction than their victims do*, it is frequently argued that they 
should evolve more rapidly and thus maintain an advantage in the 
evolutionary race between defence and counter-defence’*. This 
prediction generates an apparent paradox: how do victim species 
survive and even thrive in the face of a continuous onslaught of 
more rapidly evolving enemies’? One potential explanation is that 
defence is physiologically, mechanically or behaviourally easier than 
attack, so that evolution is less constrained for victims than for 
parasites or pathogens’. Another possible explanation is that 
parasites and pathogens have enemies themselves and that victim 
species persist because parasites and pathogens are regulated from 
the top down and thus generally have only modest demographic 
impacts on victim populations’*. Here we explore a third possibility: 
that victim species are not as evolutionarily impotent as conventional 
wisdom holds, but instead have unique evolutionary advantages 
that help to level the playing field. We use quantitative genetic 
analysis and individual-based simulations to show that victims 
can achieve such an advantage when coevolution involves multiple 
traits in both the host and the parasite. 

Most coevolutionary theory has focused on cases in which the inter- 
action rate between two species is governed by a single trait in each 
species’ (but see refs 10-12). In nature, however, species—species inter- 
action rates can depend on multiple traits'’. For example, the resistance 
of wild parsnip to webworm attack depends on flowering phenology 
and on the concentrations of at least two different furanocoumarins in 
the plant’. Similarly, many teleost fish use both mucosal barriers and 
biocidal secretions to defend against monogenean and copepod 
parasites, and parasites must overcome both mechanisms to infect 
the host successfully’*. Recent work has shown that the number of traits 
under selection can have qualitative effects on the outcome of evolution 
in single-species systems'®. Thus, it is reasonable to ask whether the 
number of traits may also affect the outcome of coevolution. 

To investigate the influence of the number of traits on coevolutionary 
trajectories, we developed a model of a victim—exploiter system in which 
the probability of a successful attack by an exploiter on a victim depends 
on n traits in each species. We assumed that each trait is functionally 
paired with a trait in the opposite species. When an exploiter encounters 
a potential victim, the probability of a successful attack depends on the 
suites of trait values in each individual. We call the probability of 
successful attack for a given encounter the ‘interaction probability’ to 
distinguish it from the interaction rate, which also depends on the 
densities of both species. The interaction probability may be highest 
when the exploiter’s trait value exceeds that of the victim (that is, 
difference traits) or when the individuals have similar trait values (that 
is, matching traits) at each functional pair. For example, the pericarp 
thickness of the Japanese camellia (Camellia japonica) and the rostrum 
length of its seed-parasitic weevil (Curculio camelliae) comprise a 
difference trait pair'’, whereas the flowering phenology of Heuchera 


grossulariifolia and the emergence time of its seed-parasitic moth 
Greya politella comprise a matching trait pair’*. Differences in inter- 
action probabilities between individuals with different trait values 
impose selection on traits and lead to coevolution between the species. 
We assumed that each trait affects the interaction probability indepen- 
dently. Thus, the victim can achieve a low interaction probability if it 
evolves at least one defence mechanism that avoids or overcomes the 
exploiter’s attack. Biologically, this corresponds to the common situation 
in which an attacker must overcome all of the chemical, morphological, 
physiological and behavioural defences of the victim for its attack to 
succeed'*'*"”, If the victim can evolve to lower its interaction probability 
with its exploiter, we say the victim ‘wins’ the evolutionary contest. 

We used a quantitative genetic analysis**”’ of our model to extend 
Kirkpatrick’s concept of maximum evolvability” for a single species to 
a maximum evolutionary escape rate for a victim-exploiter system 
(Methods). When the maximum evolutionary escape rate is greater 
than zero, the victim evolves to overcome or avoid the exploiter’s 
attack, and so wins the evolutionary contest. Our analysis assumes 
fixed genetic variance-covariance matrices (that is, G-matrices*’) in 
each species and weak selection due to interspecific interactions. For a 
very general class of interaction probability functions in the difference 
trait case, the maximum evolutionary escape rate is 


Od.max = max[—(s,G, +5-G.)&,], (1) 


where G, and G, are the G-matrices of the victim and exploiter popu- 
lations, s, and s, represent the effect of one interaction on the relative 
fitness of a victim or an exploiter, and €, is a vector that captures the 
strength of effect of traits 1 to n on the interaction probability. For a 
very general class of interaction probability functions in the matching 
trait case, the maximum evolutionary escape rate is 


Om,max = max(eig[ = (syG, + seGe)&,]] > (2) 


where &, is a diagonal matrix in which the iith entry captures the 
strength of effect of trait i on the interaction probability. 

In representative systems where victim and exploiter G-matrices 
were randomly and independently generated, increasing the number 
of traits or the strength of correlations between traits increased the 
probability that the maximum evolutionary escape rate would be greater 
than zero, and thus that the victim would win the evolutionary contest 
(Fig. 1). Intuitively, this is true because victims need to overcome their 
exploiters at only one trait if they are to escape, whereas exploiters must 
overcome their victims at all traits if they are to succeed. Each additional 
trait that affects the interaction probability provides the victim with an 
additional opportunity to evolve an effective escape mechanism. 
Correlations between traits constrain the evolution of some traits but 
facilitate the evolution of others. When correlations are strong, it is more 
likely that the victim’s evolution will be strongly facilitated or the exploi- 
ter’s evolution will be strongly constrained in at least one trait, and thus 
that the victim will evolve to escape the exploiter at that trait. 

Our quantitative genetic analysis assumes that selection is weak and 
G-matrices are constant, but in nature selection due to interspecific 
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Figure 1 | Percentage victim wins as a function of the number of traits nand 
the mean magnitude of correlations between traits. a, The difference trait 
case. b, The matching trait case. In all simulations s, = 1 and s, = —0.3. In the 
difference trait case €,; = ¢,;and in the matching trait case €,, ;; = €,,, for all [i,j]. 
interactions can be strong and G-matrices can evolve”. We used 
genetically explicit individual-based simulations*** to test the robust- 
ness of our analytical results when the assumptions of our quantitative 
genetic analysis were violated (Methods). We initially assumed that the 
sizes of the victim and exploiter populations were independent of 
the interspecific interaction rate. This might be true, for example, if 
the exploiter has alternative victim species and if the victim population 
is regulated by something other than exploitation (for example, 
resource competition). As predicted, victims were more likely to evolve 
to lower their interaction probabilities with their exploiters when the 
number of traits governing the interaction probability was large and 
when the correlations between traits were strong (Tables 1 and 2). This 
result holds whether interaction probabilities are governed by differ- 
ence traits, matching traits, or combinations of difference and match- 
ing traits (Methods and Supplementary Table 1). 

The mechanisms that promote evolutionary escape in the quantitative 
genetic analysis also operate in numerical simulations. However, 
numerical simulations revealed an additional mechanism: G-matrices 
in the victim and exploiter populations evolve to favour the victim 


Table 1 | Outcome of coevolution in the difference trait case 


n lp) = 0.00 Ip = 0.08 Ip = 0.26 lp» = 1.00 

1 0.7222" (17*) -O) -O --) 

2 0.6703 (17) 0.6295 (18) 0.5963 (32) 0.4621 (39) 
4 0.5633 (19) 0.5267 (31) 0.3304 (63) 0.1745 (77) 
8 0.4639 (26) 0.4024 (34) 0.1789 (82) 0.0336 (96) 


We conducted 100 simulations for each combination of trait number n and correlation strength r,. We 
recorded a victim win if the mean interaction probability decreased over 2,000 generations of 
interaction. Results are shown as mean final interaction probability (percentage victim wins). The 
parameter r, is the mean magnitude of correlations between pairs of traits in the same species at the 
beginning of each simulation. Biologically, rp can be interpreted as the expected magnitude of 
correlations between traits due to pleiotropy alone. The magnitude and direction of correlations can 
change as G-matrices evolve (see Methods), and we measured r, at the beginning of each simulation to 
capture correlation strength before selection had altered G-matrices. 

*Correlation is not meaningful when n= 1. 
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Figure 2 | Interaction probabilities and population densities in 
representative numerical simulations. Beaded lines show the mean 
interaction probability in each generation. Black lines show the victim density, 
and grey lines the exploiter density, with densities scaled to the carrying 
capacity of the victim. a, b, The difference trait case for n = 1 

(a; Oamax = —0.012) and n = 4 (b; da max = 0.046). ¢, d, The matching trait case 
for n= 1 (6 Ommax = —0.029) and n = 4 (d; dn. max = 0.069). Maximum 
evolutionary escape rates were calculated in the first pictured generation of each 
simulation, but changed as G-matrices evolved. 


species increasingly as the number of traits in the system increases. In 
the difference trait case, this happens because antagonistic interspecific 
interactions cause negative correlations to evolve between the victim’s 
defensive traits. These negative correlations constrain victim evolution. 
As the number of traits in the system increases, these negative correla- 
tions dissipate, and the constraint on victim evolution is removed 
(Methods and Supplementary Table 5). In the matching trait case, 
antagonistic interactions cause increased variance in the traits of the 
victim population and decreased variance in the traits of the exploiter 
population’®”®. Greater genetic variance in the victim population allows 
the victim to evolve more rapidly relative to the exploiter. As the 
number of traits in the system increases, the genetic variance in the 
victim population increases relative to that in the exploiter population 
(Methods and Supplementary Table 6). As a result, the victim can 
evolve away from the exploiter more rapidly when the number of traits 
in the system is large. 

In nature, victim and exploiter population sizes can be sensitive to 
interaction rates, and therefore coevolution can affect population 
dynamics. To test whether our qualitative results hold in such cases, 
we performed additional simulations in which the absolute fitnesses of 
individuals in each population depended on the number of interac- 
tions in which those individuals participated (Methods). These simu- 
lations showed two qualitatively different ecological outcomes (Fig. 2). 
In the first, the exploiter evolves faster than the victim in all traits. The 
interaction probability approaches one, and the exploiter wins the 
evolutionary contest (Fig. 2a, c). In the second, the victim evolves faster 
than the exploiter in at least one trait. The interaction probability 
declines and the exploiter population shrinks. As exploiters become 
less common, selection for increased victim defence is reduced, and the 


Table 2 | Outcome of coevolution in the matching trait case 


n Tp = 0.00 lp = 0.08 fp = 0.26 fp = 1.00 

1 0.7447* (22*) = -O -O 

2 0.6700 (22) 0.6602 (25) 0.5724 (37) 0.2980 (59) 
4 0.5262 (26) 0.4633 (33) 0.2373 (76) 0.0583 (92) 
8 0.3239 (43) 0.2425 (55) 0.0399 (98) 0.0074 (98) 


We conducted 100 simulations for each combination of trait number n and correlation strength rp. We 
recorded a victim win if the mean interaction probability decreased over 2,000 generations of 
interaction. Results are shown as mean final interaction probability (percentage victim wins). 

* Correlation is not meaningful when n= 1. 
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rate of victim evolution slows. As a result, the exploiter may be able to 
persist with a low interaction probability and a low population density 
(Fig. 2b, d). Ifthe populations are sufficiently small, the chance fixation 
of favourable mutations in the victim or unfavourable mutations in the 
exploiter can reduce the interaction probability until the exploiter 
cannot achieve a positive population growth rate. If this happens, 
the exploiter population crashes and there is little opportunity for 
new favourable mutations to rescue the rapidly shrinking exploiter 
population from extinction. In the difference trait, matching trait 
and mixed trait cases, the probability that the victim evolves to lower 
its interaction probability with its exploiter, and the probability that 
the exploiter goes extinct, increase as the number of traits or the 
strength of the correlations between traits increases (Methods and 
Supplementary Tables 2-4). This supports the results of our fixed- 
population analyses. 

Our study is motivated by host—parasite systems, but our models are 
general to any victim-exploiter species pair. For example, in nature 
predators must first detect, then capture, and finally subdue their 
victims, and victims have defensive traits to thwart their predators at 
each stage of the attack’’. The presence of multiple defence mechan- 
isms may help prey species to evolve and maintain low interaction 
rates with their predators. Similarly, herbivores often experience 
meaningful loss of fitness as a result of the various defence mechanisms 
of their host plants’”**. Coevolution in highly dimensional trait space 
may help plants to limit the damage they receive from herbivores, and 
so may help to explain why the world is green”. 


METHODS SUMMARY 


We modelled a victim-exploiter system in which selection is due only to inter- 
specific interactions. Exploiters encounter potential victims at random, and they 
interact with a probability that depends on a suite of n genetically determined traits 
in each individual. 

Quantitative genetic analysis. Under appropriate assumptions (Methods), the 
change in population mean trait values during one generation is”””! 


Az. = G.B, 
Ai, _ G, B, 


where each B, is a vector of selection gradients that depends on the interaction 
probability function, the fitness effect of interactions and the trait values in each 
population. For difference trait systems, we determined whether the victim wins 
by analysing the conditions under which Az,; > Az; for at least one i. For match- 
ing trait systems, (3) has an equilibrium at z.=Z,. We determined whether the 
victim wins by analysing the conditions under which the maximum eigenvalue of 
the Jacobian of (3) evaluated at z. =Z, is greater than zero. 

Numerical simulations. We complemented our analytical approach with gen- 
etically explicit individual-based simulations”. Trait values in either species were 
governed by a set of unlinked, diallelic loci with pleiotropic alleles. Pleiotropy and 
linkage disequilibria caused correlations between trait values in each population. 
We considered two well-studied heuristic interaction probability functions. An 
exploiter with trait values (Ze1, Ze2, . . . , Zen) interacts with a victim with trait values 
(Zy15 Zy2s-++sZyn) with probability 


(3) 


P(L|Ze1, Ze25+++sZens Zyls Zoey Zn) =H+ exp( —gi(Zei—zvi))) (4) 


in the difference trait case and 


n 
2 
PI |Ze1, Ze2o+++s Zens Zyl s Zy2s-ees 2yn) =H exp (—yi(2ei —2vi) ) (5) 
i= 


in the matching trait case. Parameters g; and 1; determine the sensitivity of the 
interaction probability to trait i. Reproduction is sexual, and individuals produce 
offspring in proportion to their fitnesses after interactions. This framework allows 
trait means, variances and covariances to evolve. We iterated generations and 
recorded whether the interaction probability between exploiters and victims 
increased (that is, the exploiter won) or decreased (that is, the victim won) over time. 
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Corticostriatal plasticity is necessary for learning 
intentional neuroprosthetic skills 


Aaron C. Koralek'*, Xin Jin**, John D. Long II’, Rui M. Costa®® & Jose M. Carmena’**4 


The ability to learn new skills and perfect them with practice 
applies not only to physical skills but also to abstract skills’, like 
motor planning or neuroprosthetic actions. Although plasticity in 
corticostriatal circuits has been implicated in learning physical 
skills**, it remains unclear if similar circuits or processes are 
required for abstract skill learning. Here we use a novel behavioural 
task in rodents to investigate the role of corticostriatal plasticity in 
abstract skill learning. Rodents learned to control the pitch of an 
auditory cursor to reach one of two targets by modulating activity in 
primary motor cortex irrespective of physical movement. Degrada- 
tion of the relation between action and outcome, as well as sensory- 
specific devaluation and omission tests, demonstrate that these 
learned neuroprosthetic actions are intentional and goal-directed, 
rather than habitual. Striatal neurons change their activity with 
learning, with more neurons modulating their activity in relation 
to target-reaching as learning progresses. Concomitantly, strong 
relations between the activity of neurons in motor cortex and the 
striatum emerge. Specific deletion of striatal NMDA receptors 
impairs the development of this corticostriatal plasticity, and 
disrupts the ability to learn neuroprosthetic skills. These results 
suggest that corticostriatal plasticity is necessary for abstract skill 
learning, and that neuroprosthetic movements capitalize on the 
neural circuitry involved in natural motor learning. 

The ability to learn new actions and perfect them with practice 
allows us to master skills like playing the piano or riding a bicycle. 
Learning these skills usually implies moving faster, more accurately 
and less variably*. However, mastering other types of skills, like playing 
board games or controlling neuroprosthetic devices, often does not 
directly involve changes in physical movement'®. Cortico-basal ganglia 
circuits have been implicated in the learning, selection and execution of 
physical skills**”*. In particular, plasticity in the motor cortices and the 
striatum, the major input region of the basal ganglia, has been shown to 
accompany the learning of physical skills*®. The motor cortex and 
frontal cortices have also been implicated in the learning of abstract 
skills'®’, and in learning to control neuroprosthetic devices irrespec- 
tive of physical movement’*””. Some studies suggest that not only 
cortical areas, but also the striatum, are involved in learning abstract 
skills'* °°. However, it is still unclear if the striatum is required for 
abstract skill learning, and if corticostriatal circuits undergo plasticity 
during the learning of such skills as they do during the learning of 
physical skills. Here, we use a novel behavioural task in conjunction 
with electrophysiology and genetic manipulation in rodents to investi- 
gate the role of corticostriatal circuits and corticostriatal plasticity in 
the learning of intentional neuroprosthetic actions: that is, actions per- 
formed with disembodied actuators based on the modulation of specific 
neural activity and irrespective of physical movement’. 

We developed a novel operant brain-machine interface task in 
which rodents were required to modulate activity in M1, rather than 
execute a physical movement, to obtain reward (Fig. 1a). Modulation 


of M1 ensemble activity resulted in changes in the pitch of an auditory 
cursor, which provided constant auditory feedback to rodents about 
task performance. Reward was delivered when rodents precisely 
modulated M1 activity to move this auditory cursor to one of two 
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Figure 1 | Volitional modulation of M1 neural activity in awake behaving 
rodents. a, Task schematic. M1 unit activity was entered into an online 
transform algorithm that related ensemble activity to the pitch of an auditory 
cursor. Two opposing ensembles were chosen, with activity of one ensemble 
increasing the cursor pitch and activity of the other ensemble decreasing the 
cursor pitch. Constant auditory feedback about cursor location was supplied to 
rodents, and distinct rewards were supplied when rodents brought M1 activity 
into one of two target states. b, Mean M1 ensemble firing rates for units in 
ensemble 1 (green), ensemble 2 (blue) and M1 units not used in the transform 
(black) in relation to the achievement of target 1 (top) or target 2 (bottom). 
Representative waveforms recorded from M1 in rats are shown on the right, 
with shaded regions denoting the standard deviation. 
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target tones, and a trial was marked incorrect if no target had been hit 
within a set time limit (30 s). One of these targets was associated with a 
reward of sucrose solution, whereas the other target was associated 
with a pellet reward (see Methods). Two neural ensembles consisting 
of two- to four-well-isolated units each were used to control the 
auditory cursor (Supplementary Figs 1 and 2). The action of these 
two ensembles opposed each other, such that increased activity in 
one ensemble produced increases in cursor pitch, whereas increased 
activity in the other ensemble caused decreases in cursor pitch. Thus, 
to achieve a high-pitched target, rodents had to increase activity in the 
first ensemble and decrease activity in the second; the opposite was 
required to hit a low-pitched target (Fig. 1b and Supplementary Fig. 3). 
These firing-rate modulations had to be maintained for several time 
bins (200 ms bin size) for a target to be hit (Supplementary Methods). 
Hence, in this operant task, rodents had to bring the two M1 ensembles 
into a desired state irrespective of motor output. 

We trained six male Long-Evans rats on the task, and verified that 
they exhibited marked improvement in the percentage of correct trials 
over the course of 11 days (Fig. 2a). As typically observed in motor skill 
learning”, there was a phase of rapid improvement followed by a phase 
of slower learning, representing early (days 2-4) and late (days 8-11) 
phases of learning. The percentage of correct trials increased signifi- 
cantly from early to late in learning (Fig. 2b; P< 0.001), resulting in 
performance well above chance (Fig. 2c; P< 0.001; see Supplementary 
Methods), whereas mean time-to-target decreased (Supplementary 
Fig. 4). Analyses of M1 firing rates further showed that rats were 
producing the desired neuronal ensemble rate modulations during 
task performance (Fig. 1b and Supplementary Fig. 3). Furthermore, 
sensory feedback was found to be critical for animals to learn this task 
because when rats were not given auditory feedback during training 
(although they would still get a reward if they would modulate neural 
activity correctly), the percentage of correct trials did not increase over 
the course of 9 days of training (Supplementary Fig. 5). 

We next investigated ifanimals were performing physical movements 
that would modulate the activity of those particular M1 ensembles. First, 
we monitored overall rodent movement with an accelerometer 
mounted on the recording headstage, which allowed us to measure if 
the animals produced any body or head movement during target 
achievement”. Accelerometer traces exhibited no changes before 
and during target reaching, but did show prominent deflections after 
target reaching as the animals retrieved the reward (Fig. 2d), demon- 
strating that rodents were not relying on gross motor behaviour to 
perform the task. We also monitored movements of the vibrissae with 
electromyographic (EMG) recordings of the mystacial pad (electrodes 
targeted M1 areas controlling vibrissae movement; Supplementary 
Fig. 2 and Supplementary Methods), and observed no significant 
EMG signals before target achievement, although there were clear 
EMG signals afterwards as animals retrieved and consumed the reward 
(Fig. 2e and Supplementary Fig. 6b). Importantly, there was no 
correlation between EMG activity and the spiking of the M1 neurons 
controlling the auditory cursor: the correlation coefficient for all trials 
in a behavioural session was 0.092 + 0.003 (mean + s.e.m.), and the 
distribution of correlation coefficients across a session was not signifi- 
cantly different from zero (Supplementary Fig. 6a; P = 0.57). This was 
observed across all training days, including during early learning 
(Supplementary Fig. 7). These data suggest that rats do not rely on 
physical movements to learn the task, although it is difficult to exclude 
the possibility that animals use some movement to generate neural 
activity to drive the auditory cursor during exploratory phases of 
the task in early learning. Nonetheless, the data show that animals 
eventually learn to perform the task in the absence of overt movement. 
To demonstrate further that rats did not require vibrissae movements 
to control M1 activity, we injected lidocaine into the whisker pad to 
inactivate sensory and motor nerve endings locally during a session in 
late learning (see Supplementary Methods). There was no significant 
change in performance during the temporary inactivation (Fig. 2f; 
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Figure 2 | Learning intentional neuroprosthetic actions independently of 
movement. a, Mean percentage of correct responses for all rats across days 
1-11 of learning. Shaded regions denote the range of days from which the early 
and late learning analyses were performed. b, The percentage of correct 
responses for all rats increased significantly from early (light blue) to late (dark 
blue) in learning. c, Percentage of correct responses in late learning is 
significantly greater than expected by chance. d, Representative accelerometer 
traces show no gross motor behaviour leading to target achievement (time 
zero), but clear deflections as animals initiate movement to retrieve reward. 
e, Representative EMG traces show no muscle activity in the mystacial pad 
before target achievement, but clear deflections as animals retrieve and 
consume reward. f, Mean performance in all rats when lidocaine was injected 
into the whisker pad before a behavioural session late in learning (red) 
compared with performance during a no-lidocaine session (dark blue). 

g, Significant reduction in response rate when the causal relation between target 
achievement and reward delivery was degraded (dark blue). When contingency 
was reinstated, performance rapidly returned to pre-degradation levels (red). 
h, Percentage of total correct trials that were directed at the target associated 
with pellet reward (blue) or sucrose solution reward (red) during choice 
sessions where rats had free access to pellets (left; “Pellets devalued’), or to 
sucrose before the session (right; “Sucrose devalued’). i, Percentage of total trials 
that involved responses towards target 1 (blue), target 2 (red) or response 
omissions (black) when omission tests were performed for target 1 (left) or 
target 2 (right). 


P>0.9), with rats achieving 78.1 +2.2% correct with lidocaine 
(mean =~ s.e.m.) versus 78.8 + 6.5% without lidocaine. Taken together, 
these data indicated that rodents were able to learn to control M1 
activity operantly irrespective of any overt movement. 

Goal-directed actions are sensitive to changes in the relation between 
performing the action and obtaining a reward (contingency) and to 
changes in the expected value of the reward, whereas habits are 
not”. We asked if these neuroprosthetic actions were performed 
intentionally, because the animal volitionally controlled M1 activity 
to get the outcome (goal-directed), or habitually owing to the reinforce- 
ment history. To test this, we first degraded the contingency between 
executing the action and obtaining the outcome: that is, the auditory 
cursor was still under control of M1 ensemble activity, but the 
probability of obtaining reward was similar irrespective of target 
achievement, which had no effect on the rate of reward. After 2 days 
of contingency degradation, rats markedly diminished their respond- 
ing and the percentage of correct trials decreased significantly (Fig. 2g; 
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P<0.001). When contingency was reinstated, rats resumed respond- 
ing and the percentage of correct trials returned to plateau levels seen in 
late learning (Fig. 2g). 

To investigate further the intentional nature of these neuro- 
prosthetic skills, we performed a test where each of the outcomes 
was devalued using sensory-specific satiety. Rats were given free access 
either to sucrose solution or pellets for 1h before the behavioural 
session, thereby reducing the expected value of that outcome”. After 
specific devaluation of each outcome/reward, rats chose the target 
leading to that reward much less than the target leading to the reward 
that was not devalued (Fig. 2h; P< 0.001), indicating that their actions 
were sensitive to changes in outcome value. Importantly, there were no 
significant differences in reward preference during normal task per- 
formance when neither of the outcomes was devalued (Supplemen- 
tary Fig. 8; P> 0.25). Finally, we asked whether rats were able to 
intentionally inhibit the reaching of one of the two targets to obtain 
the specific reward associated with that target. To examine this we 
performed an omission test, where the reward previously associated with 
reaching a particular target was only delivered when rats successfully 
inhibited reaching that target throughout the duration of the trial. Ifthe 
target was reached during the 30s of trial duration, no reward was 
delivered and a new trial was initiated. Importantly, reaching the other 
target continued to lead to reward as during training. Animals behaved 
ina goal-directed manner in the omission test for both targets, because 
they reduced the number of reaches for the target they had to omit 
versus the no-omission target, while increasing the number of correctly 
omitted responses (Fig. 2i; P< 0.001 for both comparisons). Taken 
together, these data show that the neuroprosthetic actions in our task 
are sensitive to changes in the causal relation between performing 
the action and obtaining the reward (contingency degradation and 
omission test), and to changes in the expected value of the outcome 
(sensory-specific devaluation), indicating that they are intentional and 
goal-directed rather than habitual. 

We next examined if learning to operantly control M1 activity 
irrespective of overt movement involves striatal plasticity, akin to what 
is observed for natural motor learning’*’*°**. We verified that the 
improvement in behavioural performance seen across learning was 
accompanied by a significant increase in firing rates in the dorsal 
striatum (DS) in late learning compared with early learning (Fig. 3a; 
P<0.001). In addition to this general increase in firing rates, we 
noticed that firing rates of DS neurons exhibited greatest modulation 
during target reaching compared with baseline control periods 
(Fig. 3b), as observed during natural motor learning’®. This modu- 
lation was significantly greater in late learning than early learning 
(Fig. 3b, c; P< 0.05), indicating that DS neurons changed their activity 
during the volitional control of M1 activity, and that this change 
increased with learning. 

We next investigated if learning, and the observed changes in DS 
target-related activity, were accompanied by corticostriatal plasticity, 
that is, changes in the functional interactions between M1 and DS 
neurons. We noticed that cross-correlation histograms between the 
two regions in late learning exhibited pronounced oscillatory spike 
coupling (Fig. 3d). To quantify this interaction, we calculated the 
coherence between spiking activity in the two regions in both early 
and late learning (Supplementary Methods). The resulting coherograms 
exhibited a clear increase in coherence at low-frequency bands in late 
learning relative to early learning (Fig. 3e), and these frequencies cor- 
responded to the oscillatory frequency seen in the cross-correlograms 
(Supplementary Methods). Furthermore, mean coherence in the theta 
band (4-8 Hz) was significantly greater in late learning than early 
learning (Fig. 3f; P< 0.001). This increase in coherence appeared to 
be related to learning to perform the task rather than higher reward 
expectation or proportion of correct trials in late learning, because 
coherence values remained high surrounding target achievement 
during the contingency degradation manipulation, where reward 
delivery was not contingent upon target achievement (Supplementary 
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Figure 3 | Learning abstract skills is accompanied by corticostriatal 
plasticity. a, Mean normalized firing rates in DS increased significantly from 
early (light blue) to late (dark blue) learning. Representative waveforms 
recorded from the DS are shown on the right (shaded regions denote standard 
deviation). b, Z-scored firing rates for individual DS units in relation to target 
achievement (time zero) showing marked modulation before target 
achievement in late learning. c, The percentage of DS units exhibiting target- 
related firing-rate modulation increased significantly with learning. d, Cross- 
correlation histograms in late learning for M1 spiking activity in relation to DS 
spikes (left), and DS spiking activity in relation to M1 spikes (right), showing 
oscillatory coupling between the two regions. e, Coherence between M1 spikes 
and DS spikes in early (left) and late (right) learning shows a clear increase in 
low-frequency coherence from early to late learning. f, Significant increase in 
mean coherence in the theta range in late (dark blue) versus early (light blue) 
learning. Shaded regions denote s.e.m. 


Fig. 9; not significantly less than non-degraded trials, P > 0.05). In 
addition, coherence levels remained high during task performance in 
incorrect trials (data not shown), further suggesting that the increase 
in coherence observed is due to learning to perform the skill rather 
than outcome anticipation. Thus, neuroprosthetic skill learning is 
accompanied by dynamic changes in functional interactions between 
M1 and the DS neurons, suggesting an important role for corticostriatal 
plasticity in this novel task. 

We therefore investigated if corticostriatal plasticity would be 
necessary for neuroprosthetic skill learning. NMDARs (N-methyl-p- 
aspartic acid receptors) in striatal medium spiny neurons are critical 
for corticostriatal long-term potentiation”. We used a knockin line 
that expresses Cre recombinase in both striatonigral and striatopallidal 
medium spiny neurons (RGS9L-cre), but not in all striatal neurons 
(for example, absent from parvalbumin interneurons; Supplemen- 
tary Fig. 10), and crossed it with mice carrying a floxed allele of the 
NMDAR1 gene”. The resulting mice lack NMDA currents in most 
projection neurons” (but not all striatal cells, hence we refer to them as 
RGS9L-Cre/Nr, not as striatal NR1 knockouts: see Supplementary 
Methods), and have impaired corticostriatal long-term potentiation”®. 
As previously described, these animals do not display any major motor 
deficits (Supplementary Videos 1 and 2) and can learn to perform 
rapid sequential movements (Supplementary Fig. 11), albeit being 
unable to learn precise motor sequences’. We investigated neuro- 
prosthetic skill learning in RGS9L-Cre/Nr¥* mice and littermate 
controls. Although control mice showed performance improvement 
across learning irrespective of physical movement as observed for rats 
(P < 0.001; Fig. 4a, b), RGS9L-Cre/Nr¥ mice exhibited marked learn- 
ing deficits on the task, with no significant increase in the percentage 
of correct trials from early to late learning (Fig. 4a; P= 0.98). 
Furthermore, acute pharmacological blockade of NMDARs in trained 
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Figure 4 | Selective deletion of NMDARs in the striatum impairs brain- 
machine interface learning. a, RGS9L-Cre/NrFf mice (red) exhibit no 
significant increase in the percentage of correct trials over the course of 
learning, despite clear performance improvement in littermate controls (blue). 
CT, controls; MT, mutants. b, Accelerometer traces from control mice showing 
no clear oscillation before target achievement, but clear deflections as mice 
retrieve reward. c, Late firing rate normalized to early firing rate in controls and 
mutants. There is no significant increase in DS firing rates in RGS9L-Cre/Nr 
mice (red) from early to late learning, although DS firing rates increase 
markedly in control mice (blue). d, DS units of controls exhibit strong target- 
related firing-rate modulations, including both excitation (left) and inhibition 
(right). e, The percentage of DS units showing significant target-related firing- 
rate modulations increases significantly across learning in control mice (blue), 
but not in RGS9L-Cre/Nr¥7 mutants (red). f, Coherograms showing coherence 
between M1 spikes and DS spikes in end (left) and late (right) learning both for 
control mice (top) or RGS9L-Cre/Nri mice (bottom). Coherence in low- 
frequency bands increases from early to late learning in control mice, but not in 
RGS9L-Cre/Nr ¥ mice. g, Mean coherence in the theta range for control (top) 
and mutant (bottom) mice. There is a significant increase in coherence from 
early (light blue) to late (dark blue) learning in control mice but not in mutant 
mice (early learning, light red; late learning, dark red). 


control animals did not affect performance of the neuroprosthetic skill 
(even at relatively high doses that affect striatal burst firing; Sup- 
plementary Figs 12 and 13), suggesting that the deficits observed in 
RGS9L-Cre/Nr! mice are not due to inability to perform the skill but 
rather to the inability to learn the task. 

Consistent with the findings above in rats (Fig. 3a), DS neurons in 
littermate control mice exhibited a significant increase in firing rate 
across learning, whereas in mutants they did not (Fig. 4c; main effect of 
genotype Fi 19 = 32.45, P<0.001; early versus late P<0.05 for 
controls (CT) and P = 0.23 for mutants (MT)). Also, in control mice, 
the proportion of DS neurons with significant target-related firing-rate 
modulation increased with learning (Fig. 4d, e; P< 0.05), but this was 
not observed in RGS9L-Cre/Nr Ff mice (Fig. 4e; P = 0.28). Finally, the 
development of functional corticostriatal interactions during learning 
was also abolished in RGS9L-Cre/NrP mutants, with no significant 
increase in coherence between M1 and DS spikes with learning (Fig. 4f, 
, Fg0,10 = 0.65, P = 0.44), although littermate controls showed a clear 
increase as seen in rats (Fig. 4f, g; Fgo19 = 4.86, P<0.05). Taken 
together, these results demonstrate that the striking corticostriatal 
plasticity observed in rats during learning also occurs in control mice, 
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but this plasticity is absent in mice with a decrease in functional 
NMDARs in striatum. These mutant mice do not show improvement 
with training, therefore indicating that corticostriatal plasticity may be 
necessary for learning to modulate M1 states intentionally to obtain 
specific outcomes. 

In summary, we used a novel operant task in rodents to demonstrate 
that corticostriatal networks exhibit profound plasticity during the 
learning of intentional neuroprosthetic skills and, further, that disrupt- 
ing this plasticity impairs learning. This adds great support to the 
claims that cortico-basal ganglia circuits play a role in abstract cognitive 
processes'**°, We observed that DS neurons strongly modulated their 
activity in relation to M1 activity, even when the latter was dissociated 
from physical movements, suggesting that the striatum is important for 
learning and selecting abstract actions that are controlled by cortical 
output. Hence, these data suggest that cortico-basal ganglia circuits 
may be involved in learning mental actions and skills that do not 
require physical movement, indicating that they may have a broader 
function involved in intention and decision-making than previously 
acknowledged. 

Our results also have important implications for the field of brain- 
machine interfaces*®. The abstract actions investigated here form the 
basis for skilful neuroprosthetic control’® and, as we have shown here, 
they recruit elements of the natural motor system outside of M1. Thus, 
our results suggest that neuroprosthetic movements capitalize on the 
neural circuitry for motor learning and therefore have great potential 
to feel naturalistic, generalize well to novel movements and environ- 
ments, and benefit from our nervous system’s highly developed storage 
and retrieval mechanisms for skilled behaviour. 


METHODS SUMMARY 


All experiments were done in accordance with the Animal Care and Use Committee 
at the University of California, Berkeley, and at the National Institute on Alcohol 
Abuse and Alcoholism, and according to National Institutes of Health guidelines. 
Six male Long-Evans rats, seven RGS9L-Cre/Nr¥ mice and eight littermate 
controls were chronically implanted with tungsten microelectrode arrays in both 
M1 and the DS ipsilaterally. Two ensembles of two to four well-isolated M1 units 
each were chosen and, by modulating activity in these ensembles, rodents 
controlled the pitch of an auditory cursor, with increased activity in the first 
ensemble producing increases in the cursor pitch and increased activity in the 
other ensemble producing decreases in the cursor pitch. The rodents had to 
modulate these ensembles precisely to move the cursor to one of two target pitches 
to get reward (one associated with 20% sucrose and another with pellets). Rodents 
were free to choose either reward at any time. A trial was correct if a target was 
achieved within 30 s and incorrect trials were followed by a time-out. M1 activity 
levels had to return to baseline levels for a new trial to begin. After performance had 
reached plateau levels, the action—-outcome contingency was degraded by provid- 
ing outcomes on a variable time schedule to match the probabilities between 
getting a reward after target achievement and no target achievement. For outcome 
devaluation, rodents were given a sensory-specific satiety test where they received 
free access to one of the rewards (but not the other) for 1h before a behavioural 
session. For the omission test, rodents stopped being rewarded for reaching one of 
the targets and the reward associated with that target was instead supplied when 
rodents successfully inhibited responses for the duration of the trial. See 
Supplementary Information for further details. 
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Cells are organized on length scales ranging from angstrém to 
micrometres. However, the mechanisms by which angstrém-scale 
molecular properties are translated to micrometre-scale macroscopic 
properties are not well understood. Here we show that interactions 
between diverse synthetic, multivalent macromolecules (including 
multi-domain proteins and RNA) produce sharp liquid-liquid- 
demixing phase separations, generating micrometre-sized liquid 
droplets in aqueous solution. This macroscopic transition corre- 
sponds to a molecular transition between small complexes and large, 
dynamic supramolecular polymers. The concentrations needed for 
phase transition are directly related to the valency of the interacting 
species. In the case of the actin-regulatory protein called neural 
Wiskott-Aldrich syndrome protein (N-WASP) interacting with its 
established biological partners NCK and phosphorylated nephrin’, 
the phase transition corresponds to a sharp increase in activity 
towards an actin nucleation factor, the Arp2/3 complex. The trans- 
ition is governed by the degree of phosphorylation of nephrin, 
explaining how this property of the system can be controlled to 
regulatory effect by kinases. The widespread occurrence of multi- 
valent systems suggests that phase transitions may be used to spa- 
tially organize and biochemically regulate information throughout 
biology. 

Covalent and non-covalent interactions between multivalent small 
molecules are central elements of classical polymer chemistry and 
physics and of supramolecular chemistry~~*. These fields have produced 
theories and experimental demonstrations of sharp transitions between 
small assemblies and macroscopic polymer gels (known as sol-gel 
transitions) as the degree of bonding increases. The transition point 
(critical point) depends on the physical properties of the monomeric 
species, including valency and affinity. The polymer can havea variety of 
physical forms, ranging from phase-separated liquid to crystalline solid. 
For non-covalent systems, phase separation can strongly influence the 
sol-gel transition by altering the degree of bonding”®. In biology, inter- 
actions between multivalent entities are found in diverse processes, 
including extracellular carbohydrate-lectin binding, intracellular signal- 
ling, RNA metabolism and chromatin organization in the nucleus””®. 
Biological multivalency has been studied most extensively in the 
context of extracellular ligands binding to cell surface receptors, where 
antibody-receptor’’ and carbohydrate-lectin’ systems can assemble 
into crosslinked networks. These networks are typically precipitates'™”, 
but liquid-like gels have also been described’’. Multivalency has been less 
studied in the context of intracellular molecules, which often share the 
characteristics of high valency, modest affinity, and long, flexible con- 
nections between binding elements'*. Here we asked whether these 
systems also undergo sharp transitions to polymers, and if so, what 
the macroscopic properties of the polymers are and how such transitions 
could be regulated and affect function. 


Initially, we examined interactions between the SRC homology 3 
(SH3) domain and its proline-rich motif (PRM) ligand, two widely 
observed modules that often appear in tandem arrays in signalling 
proteins®*'*. We generated two classes of engineered proteins: one 
composed of repeats of a single SH3 domain (SH3,,, where 
m=1-5), and the other composed of repeats of a PRM ligand 
(PRM,, where n= 1-5) (dissociation constant (Kg) = 350M for 
the SH3;-PRM, interaction; Supplementary Fig. 1). Initially, we mixed 
SH3, with PRM,. At low concentrations, the solutions were clear; by 
contrast, at high concentrations, they were opalescent. Examination of 
these opalescent solutions using light microscopy showed the presence 
of numerous spherical droplets of approximately 1 um to >50 um in 
diameter that had phase-separated from the bulk solution (Fig. la and 
Supplementary Fig. 2). Smaller droplets tended to coalesce into larger 
droplets over time, which is consistent with liquid-like properties 
(Fig. 1b). When the proteins were mixed ina 1:1 ratio, both the droplet 
and bulk phases contained equal amounts of each molecule, but the 
proteins were concentrated by about 100-fold in the droplets relative to 


Figure 1 | Macroscopic and microscopic phase transitions in multivalent 
SH3-PRM systems. a, Liquid droplets observed by differential interference 
contrast microscopy (left) and wide-field fluorescence microscopy (right) when 
300 uM SH3,, 300 1M PRM, (both of which are module concentrations; 
molecule concentrations are 75 uM) and 0.5 uM OG-SH3, were mixed. Scale 
bars, 20 um. b, Time-lapse imaging of merging droplets that were formed as in 
a. Scale bar, 10 um. 
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the bulk phase (Fig. 1a, right; 116-fold for SH3; plus PRM;; and 82- 
fold for SH3; plus an octameric dendrimer, PRM(N-WASP)s). 
Analogous droplets were also observed for an unrelated SH3;-ligand; 
pair and for the tetravalent RNA binding protein PTB interacting with 
an RNA oligonucleotide (Supplementary Fig. 3). Thus, liquid-liquid- 
demixing phase transitions may occur in many multivalent intra- 
cellular systems. 

A large body of data indicates that the phase separation observed 
here is driven by the assembly of the multivalent proteins into large 
species, analogous to the behaviour observed in many small-molecule 
polymer systems”* as well as in covalent protein crosslinking”’. First, 
the phase boundary is strongly dependent on the valency of the inter- 
acting species (Fig. 2a and Supplementary Fig. 4). This observation is 
consistent with theory and our simulations, which indicate that a higher 
valency allows the formation of larger species at a lower fractional 
saturation of the binding modules**'® (Supplementary Information 
and Supplementary Figs 5-7). 

Second, the phase transition can be blocked by a high-affinity 
monovalent ligand, PRM(H), (Ka=10uM for SH3, binding to 
PRM(H)); Supplementary Fig. 8). Third, we used dynamic light scat- 
tering (DLS) and small-angle X-ray scattering (SAXS) to characterize 
the species that formed during titrations of PRM proteins into SH3; 
(Fig. 2b and Supplementary Figs 10-12). At concentrations below 
those needed for droplet formation, the addition of either PRM, or 
PRM, substantially increased the apparent hydrodynamic radius (Rp, 
for DLS) and the apparent radius of gyration (Rg, for SAXS), suggesting 
that oligomeric species formed. For equal total module concentrations, 
PRM, caused larger increases in these values than did PRM), consist- 
ent with predictions from polymer theory***"*. For higher concentra- 
tions of PRM proteins, we removed the droplets by centrifugation and 
found that increasing the concentration of PRM, or PRM, caused Ry 
and R, to decrease in the bulk phase, suggesting that larger species 
partitioned selectively into the droplet phase. This behaviour resembles 
that observed in the sol-gel transitions of covalent polymers, in which, 
above the critical extent of reaction, the average size of oligomers in the 
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Figure 2 | Multivalency drives phase separation and probably drives a sol- 
gel transition in the droplet phase. a, Phase diagrams of multivalent SH3 and 
PRM proteins. The concentrations are in terms of the modules. The red circles 
indicate phase separation, and the blue circles indicate no phase separation. 
b, The R, values determined from SAXS data that were collected during 
titrations of PRM proteins into SH35. Closed circles indicate the absence of 
phase separation; open circles indicate data collected on the supernatant phase, 
which was separated from the droplets by centrifugation. The titrations used 
PRM, (orange), PRM, (blue), PRM, (green) and PRM(H)), (red). The error 
bars represent the s.d. calculated from five to ten independent measurements of 
intensity versus scattering angle (q). c, The intensity autocorrelation curve of 
light scattered at 90° from the pooled droplet phase of SH3; plus PRM(N- 
WASP)s. 7, the relaxation time constant of the most rapidly decaying phase. 
d, Cryo-electron microscopy image of a droplet formed by SH3; plus PRM; 
(identical image, left and right). The edge of the structure is outlined in red 
(right). Scale bars, 100 nm. 
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sol phase decreases because larger species preferentially join the gel”. 
Titrations of PRM, or PRM(H),, which unlike PRM, and PRM, 
cannot generate polymers and did not cause phase separation, pro- 
duced only small, saturable increases in Ry, and R,. Taken together, 
these data indicate that phase separation is driven by the unique ability 
of multivalent SH3,,-PRM,, interactions to create large assemblies. 

Additional lines of evidence suggest that the multivalent proteins 
formed large polymers within the droplets, such that the phase trans- 
ition probably coincides with a sol-gel transition. First, at the extremely 
high concentrations in the droplet phase, the fractional saturation of 
SH3 and PRM binding sites is estimated to be threefold to fivefold 
higher than that needed to induce the sol-gel transition’® (Supplemen- 
tary Materials, Cyclization). 

Second, DLS analyses of the droplet phases created by mixing SH3,, 
and PRM,, showed multiphase intensity autocorrelation curves with a 
complex distribution of relaxation times spanning 0.2-20 ms (Sup- 
plementary Fig. 13) or longer (Fig. 2c). Some of these relaxation times 
nonlinearly scale with the square of the scattering angle (q°) (Sup- 
plementary Fig. 13). The wide range of timescales, the presence of 
long-timescale processes and the q” independence of some of these 
processes are properties that are typical of polymer solutions in the 
semi-dilute range but are highly atypical of discrete molecular species”. 

Third, photobleaching experiments showed that the diffusion of the 
droplet constituents was slowed by about three orders of magnitude 
relative to free diffusion in water (data not shown). Furthermore, the 
photobleaching recovery rate correlated inversely with the monomer- 
monomer affinity and valency (Supplementary Fig. 14), suggesting 
that recovery represents reorganization of a polymer matrix. Small- 
molecule fluorophores and enhanced green fluorescent protein 
(eGFP) can enter, and move rapidly within, the droplets (Supplemen- 
tary Fig. 15). 

Last, cryo-electron microscopy images of SH3; plus PRM; solutions 
that were flash-frozen immediately after mixing showed numerous 
round, electron-dense objects with diameters of about 50-500 nm 
(Fig. 2d and Supplementary Fig. 16). These objects were not observed 
with high concentrations of either SH3; or PRM; alone or when the 
two components were mixed at concentrations below the droplet con- 
centration. Although the objects were too dense for structures to be 
observed within them, their edges were highly irregular on the ~10-nm 
scale, suggesting that they contained large but disordered molecular 
species, consistent with a crosslinked gel. These characterizations of 
the droplet phase suggest that this phase contains large polymeric 
species and has undergone a sol-gel transition. 

Taken together, our data suggest a model in which the association of 
multivalent proteins produces a macroscopic liquid-liquid phase sepa- 
ration, which is thermodynamically coupled to a molecular sol-gel 
transition within the droplet phase. The sharp transition between small 
complexes and polymers is consistent with condensation polymeriza- 
tion theory*** and our own particle-based simulations (Supplemen- 
tary Fig. 5). A theoretical analysis of these systems (Supplementary 
Information) indicates that the polymerization process is driven 
appreciably by the extremely high configurational entropy of the 
polymer*"®. 

The behaviour that we observed for these multivalent systems in 
vitro was mirrored in cells. The coexpression of mCherry-SH3; and 
eGFP-PRM,; fusion proteins in HeLa cells resulted in the formation of 
approximately 0.5-2-m diameter cytoplasmic puncta containing 
both fluorophores (Fig. 3a). Puncta were not observed in cells expres- 
sing either protein alone or in cells coexpressing mCherry-SH3; and 
eGFP-PRM3, indicating that their formation depends on the inter- 
action between the two high-valency molecules. The puncta did not 
stain with a large range of vesicle markers or a lipid dye, suggesting that 
they are phase-separated bodies rather than vesicular structures 
(Supplementary Fig. 18). Both mCherry and eGFP fluorescence of 
the bodies recovered within about 10s of photobleaching (Fig. 3b), 
indicating that there was rapid exchange of both components with the 
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Figure 3 | Coexpression of SH3; and PRM; in cells produces dynamic 
puncta. a, mCherry—SH3s (left), eGFP-PRMs (centre) and an image overlay in 
a cell expressing both proteins (right). It should be noted that the non-uniform 
eGFP fluorescence in the puncta results from mCherry-eGFP fluorescence 
resonance energy transfer (FRET) rather than from differential localization of 
the proteins (Supplementary Fig. 17). Scale bar, 2 jm. b, Both mCherry and 
eGFP fluorescence recover rapidly after photobleaching. 


surrounding cytoplasm and suggesting that the bodies had a dynamic 
liquid-like nature. Thus, interactions between multivalent proteins can 
produce phase-separated liquid droplets both in vitro and in cells. 

The nephrin-NCK-N-WASP system is a natural, three-component 
interaction that can be used to investigate phase transitions that result 
from multivalent interactions, as well as the functional consequences 
of these transitions (Fig. 4a). In kidney podocytes, the transmembrane 
protein nephrin plays a central role in forming the glomerular filtra- 
tion barrier, functioning partly through assembling cortical actin’. The 
cytoplasmic tail of nephrin contains three tyrosine phosphorylation 
(pTyr) sites, which can each bind the SH2 domain of NCK'’’. NCK 
contains three SH3 domains, which can bind the six PRMs in the 
proline-rich region of N-WASP”°. N-WASP, in turn, stimulates the 
nucleation of actin filaments by the Arp2/3 complex. The multivalency 
of nephrin or NCK is necessary for proper actin assembly” and, 
together with the multivalency of N- WASP, has the potential to cause 
phase transitions. 

The addition of NCK to an N-WASP construct (GBD-P-VCA, 
Supplementary Table 1; called N-WASP hereafter) caused droplet 
formation, as occurred in the model systems described above 
(Fig. 4b). The addition of a diphosphorylated (2pTyr) nephrin tail 
peptide dropped the phase boundary for both proteins by more than 
or equal to twofold (Fig. 4c), presumably because the effective valency of 
NCK increases when it is arrayed on nephrin. This effect was even more 
pronounced when nephrin-3pTyr peptide was added (to the same total 
pTyr concentration) (Fig. 4d). Thus, in cells, protein kinases could 
regulate phase transitions in this system (and the cooperative assembly 
of all three proteins) by controlling the degree of phosphorylation of 
nephrin and, consequently, by shifting the phase boundary from the 
micromolar (Fig. 4b) to the nanomolar (Fig. 4d) regime. It should be 
noted that, where measured, the cytoplasmic concentrations of WASP 
proteins and other actin-regulatory molecules were typically 1-10 uM, 
indicating that shifts in this range could be functionally important 
(Supplementary Fig. 19). 

We next asked how droplet formation affects the ability of the 
nephrin-NCK-N-WASP system to stimulate Arp2/3-mediated actin 
assembly. We measured the half-time to completion (t,/.) of pyrene- 
actin assembly reactions containing fixed concentrations of the Arp2/3 
complex, NCK, nephrin-3pTyr peptide and N-WASP plus variable 
amounts of an N-WASP truncation mutant (N-WASPA) that contains 
the full proline-rich region and can assemble into polymers but cannot 
bind the Arp2/3 complex. In the absence of N-WASPA, 50nM 
N-WASP produced only weak stimulation of the Arp2/3 complex 
(long t,/2) (Fig. 4e). The addition of N-WASPA to concentrations 
between 0 and 750nM had no effect on actin assembly. However, 
1,000 nM N-WASPA sharply increased the activation of the Arp2/3 
complex. The activity increased asymptotically as the concentration of 
N-WASPA was raised further. N-WASPA had no effect on an 
N-WASP protein that lacks the proline-rich region, suggesting that 
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Figure 4 | Phase transition correlates with biochemical activity transition in 
the nephrin-NCK-N-WASP system. a, The interactions of nephrin, NCK 
and N-WASP. b-d, Phase diagrams of N-WASP and NCK alone (b, inset 
shows higher concentrations) or in the presence of 4.5 uM diphosphorylated 
nephrin tail peptides (c) or 3 1M triphosphorylated nephrin tail peptides 

(d). The red circles indicate phase separation, and the blue circles indicate no 
phase separation. e, The half-time to completion (t;/2) of N-WASP-stimulated 
actin assembly by the Arp2/3 complex as a function of N-WASPA 
concentration. The vertical dashed line indicates the phase-separation 
boundary determined in separate assays without actin and the Arp2/3 complex. 
f, Rhodamine-actin (4 UM, 10% rhodamine-labelled), 300 nM Alexa-488- 
phalloidin and 10 nM Arp2/3 complex were added to droplets containing 
triphosphorylated nephrin, NCK and N-WASP, and the droplets were imaged 
by confocal microscopy; rhodamine (top) and Alexa 488 (bottom) are shown 
separately. Scale bars, 10 um. 
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engagement by NCK is needed for the increase in activity (Supplemen- 
tary Fig. 20). 

These data are consistent with the switch-like formation of a higher 
activity” (probably polymeric) form of nephrin-NCK-N-WASP 
when the total concentration of wild-type (active) plus truncated 
(inactive) N-WASP surpasses that needed for droplet formation. 
The sharp transition was followed by slower increases in activity as 
additional N-WASPA drew a greater percentage of the wild-type protein 
into the droplets/polymer. Consistent with these ideas, when mixtures 
containing N-WASP above its phase-separation concentration were 
stained with phalloidin, numerous bundles of actin filaments were 
observed within the droplets (Fig. 4f). In cells, nephrin is a transmem- 
brane protein, and the system would not produce a three-dimensional 
polymer phase as observed here but rather its two-dimensional equi- 
valent at the plasma membrane. Such interactions might contribute to 
the formation and/or the stability of the micrometre-scale clusters of 
nephrin and NCK with the associated actin tails, which are observed in 
cells when nephrin is crosslinked (and subsequently phosphorylated)’. 
Of the 28 known binding partners of the NCK SH2 domain, 14 are 
predicted or have been demonstrated to contain three or more pTyr 
sites***, suggesting that analogous pTyr-NCK-N-WASP assembly 
may occur in many systems. 

Sharp phase transitions occurring concomitantly with sol-gel transi- 
tions may be a general feature of multivalent systems in biology. Many 
‘cellular bodies’—that is, subcellular compartments that are composi- 
tionally distinct from the surrounding cytoplasm or nucleoplasm but 
that are not membrane bounded**”*—are enriched in multivalent 
proteins and nucleic acids””’. These include promyelocytic leukaemia 
nuclear bodies, Cajal bodies, P bodies and P granules”. Moreover, P 
granules in the Caenorhabditis elegans embryo were recently shown to 
have liquid-like properties with many of the features that we observed 
here, including switch-like formation and greatly slowed diffusion of 
constituent molecules*®*. Many multivalent proteins, in general, can 
organize into puncta in the cytosol or at membranes (Supplementary 
Fig. 21). Within puncta, the physical properties of a polymer could 
impart micrometre-scale structural and dynamic organization and 
control the chemistry (for example, of catalysis, molecular interactions 
or structural rearrangements). Our findings provide a mechanism by 
which multivalent interactions could yield sharp transitions between 
physically and functionally distinct states, generating nonlinearity in 
signalling pathways, connecting disparate length scales in the cell, and 
perhaps contributing to the structure and function of cellular bodies 
and other two- and three-dimensional compartments. 


METHODS SUMMARY 

Material generation. Details are provided in the Supplementary Information. 
In vitro phase separation. Samples were incubated for >12 h before scoring them 
for droplets using bright-field microscopy; when observed, the droplets formed 
immediately after mixing. For in vitro fluorescence recovery after photobleaching 
(FRAP) experiments, a 5-j1m diameter spot was bleached in droplets of >20-,1m 
diameter containing Oregon Green (OG)-labelled SH3, (OG-SH3,) by using a 
488-nm laser line. The mean intensity of the bleached spot was fit to a single 
exponential. 

DLS and SAXS. For titrations monitored by DLS (DynaPro, Wyatt) and SAXS 
(Advanced Photon Source, Biophysics Collaborative Access Team (BioCAT) 
beamline), the samples contained 170 1M SH3; plus PRM proteins at PRM:SH3 
module ratios of 0-5. Droplets were removed from all of the relevant samples by 
centrifugation (at 16,000g for 10 min) before analysis. For SH3; plus PRM(N- 
WASP)s and SH3; plus PRM; droplets, single-angle DLS data were collected on a 
DynaPro instrument, and multi-angle DLS data were collected on a custom-built 
apparatus. 

Cryo-electron microscopy. The samples were blotted, frozen immediately after 
mixing and imaged under low-dose cryo conditions in a 2200FS FEG transmission 
electron microscope (JEOL), using a 2k < 2k slow-scan charge-coupled display 
(CCD) camera (Tietz). 

Cellular assays. HeLa cells were imaged at 32°C 24h after transfection with 
vectors expressing mCherry-SH3; and/or eGFP-PRM; (or eGFP-PRMs). 
Images were acquired on an LSM 510 confocal microscope (Zeiss). For FRAP 
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experiments, individual puncta containing both mCherry-SH3; and eGFP-PRM; 
were bleached with a 488-nm laser line. 

Actin polymerization. Actin (4 1M, 5% pyrene-labelled actin) was polymerized 
in the presence of 3 uM nephrin-3pTyr, 10 uM NCK, 50 nM N-WASP and 10 nM 
Arp2/3 complex plus increasing concentrations of N-WASPA in 150KMEI buffer 
(150mM KCl, 1mM MgCl, 1mM EGTA and 10mM imidazole, pH7) as 
described previously’. Imaging was performed on an LSM 510 confocal micro- 
scope after adding rhodamine-actin (4 uM, 10% rhodamine-labelled actin), 
300 nM Alexa 488-phalloidin and 10 nM Arp2/3 complex to droplets containing 
3 uM nephrin-3pTyr, 10 uM NCK and 2 uM N-WASP. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Measurements of protein stoichiometry and concentration in droplets. All 
phase-separation assays were performed in 150KMEI buffer (150mM KCl, 
1mM MgCl, 1 mM EGTA and 10 mM imidazole, pH 7), except where indicated. 
SH3, and PRM, (doped with 0.5% of either protein labelled with Oregon Green) 
were incubated at a 1:1 stoichiometry at various concentrations at room temper- 
ature overnight. The droplet phase (where formed) was collected by centrifugation, 
and the supernatant was removed and digested with proteinase K (Promega) 
overnight. The fluorescence intensity (excitation wavelength (1.x) = 488 nm, 
and emission wavelength (Aem) = 512-521 nm) of the samples was converted to 
concentration using a standard curve of Oregon Green alone. The data revealed a 
1:1 stoichiometry in all cases in both the droplet and the supernatant phases. In 
other systems that were mixed at a 1:1 module ratio, we calculated the concentra- 
tions from the absorbance at 280nm using module-weighted extinction coeffi- 
cients, assuming a 1:1 module stoichiometry in all phases. 

SAXS. SAXS experiments were performed at the Biophysics Collaborative Access 
Team (BioCAT) undulator beamline 18-ID at the Advanced Photon Source, 
Argonne National Laboratory as described previously*'. All samples were centri- 
fuged for 10 min at 16,000g at 4°C before mixing. In mixtures above the phase- 
transition concentration, solutions were centrifuged for an additional 10 min to 
remove the droplet phase before SAXS analysis. To minimize radiation damage, 
all samples were in 150KMEI buffer containing 5mM TCEP and were flowed at 
5 ls | during data collection*’. Scattering intensity profiles over the q range from 
0.006 to 0.36 A! were calculated from radial averaging of the two-dimensional 
scattering patterns using macros written by the BioCAT staff for the software 
IGOR Pro (WaveMetrics). The scattering for each sample and the accompanying 
buffer were averaged from five to ten repeats, and the protein scattering was obtained 
by subtracting the buffer scattering. Guinier plots were used to calculate R, and to 
ascertain the absence of aggregates. Alternatively, Ry and the distance distribution 
function, P(r), were calculated using the program GNOM™. The apparent molecular 
weight was estimated*’ by integrating the Kratky plot to a maximum q at 0.2 Al 
DLS titration. In the titration experiments, scattering measurements were per- 
formed on a DynaPro instrument (Wyatt) at 30% laser power and 22 °C. Twenty 
repetitive data sets, each of 100s duration, were averaged for each titration point. 
Autocorrelation data were analysed using the regularization method in Dynamics 
version 6.4.3 software (Proterion Corporation). All samples were centrifuged 
before and after mixing, as for the SAXS titration. 

DLS of SH35 plus PRM(N-WASP)g droplets. To collect enough volume of the 
droplet phase (~25 pl), 850 uM SH3; was mixed with 850 uM PRM(N-WASP)g 
(module concentrations) to a final volume of 900 ul in 150KMEI buffer. The 
denser, droplet, phase was collected by centrifugation (16,000g for 10 min) and 
analysed using a DynaPro instrument at 22°C and 20% laser power. Twenty 
repetitive data sets, each of 1,000 duration, were averaged. 

Multi-angle DLS of SH3,; plus PRM, droplet and supernatant phases. SH3, 
(850 uM) was mixed with PRM; (850 1M) (module concentrations) to a final 
volume of 900 pil to collect enough volume of the droplet phase (~20 pl). The 
denser, droplet, phase was collected by centrifugation. Measurements were made 
using a custom-built apparatus equipped with an Innova 90 argon laser (Coherent) 
set to 514.5nm. A wide-range photometer—preamplifier-discriminator (Pacific 
Precision Instruments) drove a pulse shaper (ALV), which fed an ALV-5000 digital 
autocorrelator (ALV). The temperature was maintained at 25 °C by a circulating 


LETTER 


water bath. For the supernatant phase, three 10-min runs were collected and 
averaged at each scattering angle between 30° and 110°. For the droplet phase, a 
single 30-min run was collected at each angle. The distributions of the decay rate 
and the average decay rate were determined by analysing the correlation functions 
using an inverse Laplace transformation in the program CONTIN™. 
Cryo-electron microscopy. Carbon-coated copper grids were glow-discharged in 
an EMS-100 unit with a 40 mA current for 90 s. SH3, (100 uM) was loaded onto 
the grid, followed by PRMs (100 uM) (module concentrations), giving a 4 jl final 
volume. The mixture was immediately blotted for 6.5s in a Vitrobot Mark II 
(Gatan) before being plunged into liquid ethane. Mixtures of SH3; and PRM; 
below the critical concentrations (2.5 uM module concentration each), or SH35 or 
PRM, alone (250 uM module concentration) were prepared identically. Samples 
contained 150KMEI buffer. Frozen grids were stored in liquid nitrogen until 
electron microscopy analysis. Imaging was performed under low-dose cryo con- 
ditions in a 2200FS FEG transmission electron microscope (JEOL), using a 2k X 2k 
slow-scan charge-coupled display (CCD) camera (Tietz). The dose rate was 
adjusted to ~20 electrons ‘AS? s ';the defocus level varied from — 1.0 to —3.0 pum. 
Cell culture and fluorescence microscopy. HeLa cells were cultured in Dulbecco’s 
modified Eagle’s medium (DMEM) containing 10% FBS. Cells were transfected 
using Effectene (QIAGEN) and incubated for 24h at 37 °C. For Nile-Red staining, 
cells were incubated with Nile Red (ata final concentration of 0.5 jig ml 1) for 15, 30 
or 60 min before imaging. For immunofluorescence analysis, transfected cells were 
fixed with 4% paraformaldehyde in PBS and permeabilized with 0.1% Triton X-100 
in PBS for 5 min. The antibodies used for staining were anti-EEA1, anti-GM130, 
anti-golgin-97, anti- VPS35, anti-CIMPR, anti-clathrin heavy chain, anti-LAMP1, 
anti-SNX1, anti-SNX2, anti- TGN46 (provided by D. Billadeau and T. Gomez) and 
anti-caveolin-1 antibodies. Secondary antibodies conjugated to Alexa Fluor 647 
were obtained from Invitrogen. Images were collected on a DeltaVision system 
(Applied Precision). The mCherry-SH3; and eGFP-PRM, constructs were 
generated by cloning SH3; and PRM; into the vectors pmCherry-N1 and 
peGFP-N1, respectively. Images were processed using the program Image] 
(http://rsb.info.nih.gov/ij). 

In vitro FRAP assays. Fluorescence recovery after photobleaching (FRAP) 
experiments were performed on an LSM 510 confocal microscope (Zeiss) using 
a 488-nm laser line. The OG-SH3, or eGFP was bleached using ten iterative pulses 
(total time ~0.55 s) with full laser power. The droplets were >20 tum in diameter, 
and the bleached spots were 5 tm in diameter. The imaging power was 0.5% for 
OG-SH3, and 2% for eGFP. Images were processed using ImageJ. Background- 
corrected data were fit to a single exponential decay to yield the recovery time 
constant using GraphPad Prism 5 (GraphPad Software). 
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A petunia ABC protein controls strigolactone- 
dependent symbiotic signalling and branching 


Tobias Kretzschmar’, Wouter Kohlen?*, Joelle Sasse!*, Lorenzo Borghi’, Markus Schlegel’, Julien B. Bachelier!, Didier Reinhardt’, 


Ralph Bours”, Harro J. Bouwmeester”* & Enrico Martinoia! 


Strigolactones were originally identified as stimulators of the 
germination of root-parasitic weeds' that pose a serious threat to 
resource-limited agriculture’. They are mostly exuded from roots 
and function as signalling compounds in the initiation of arbuscular 
mycorrhizae’, which are plant-fungus symbionts with a global effect 
on carbon and phosphate cycling*. Recently, strigolactones were 
established to be phytohormones that regulate plant shoot architec- 
ture by inhibiting the outgrowth of axillary buds**. Despite their 
importance, it is not known how strigolactones are transported. 
ATP-binding cassette (ABC) transporters, however, are known to 
have functions in phytohormone translocation’’. Here we show 
that the Petunia hybrida ABC transporter PDR1 has a key role in 
regulating the development of arbuscular mycorrhizae and axillary 
branches, by functioning as a cellular strigolactone exporter. 
P. hybrida pdr1 mutants are defective in strigolactone exudation 
from their roots, resulting in reduced symbiotic interactions. Above 
ground, pdr1 mutants have an enhanced branching phenotype, which 
is indicative of impaired strigolactone allocation. Overexpression of 
Petunia axillaris PDR1 in Arabidopsis thaliana results in increased 
tolerance to high concentrations of a synthetic strigolactone, 
consistent with increased export of strigolactones from the roots. 
PDRI1 is the first known component in strigolactone transport, 
providing new opportunities for investigating and manipulating 
strigolactone-dependent processes. 

Strigolactones are a new class of carotenoid-derived"® phytohormone 
in land plants. In addition to their role in shoot branching, strigolactones 
are exuded into the rhizosphere under phosphorus-limiting conditions° 
and act as growth stimulants of arbuscular mycorrhizal fungi*. To 
identify efflux carriers of arbuscular-mycorrhiza-promoting factors such 
as strigolactones, we used a degenerate primer approach (Supplementary 
Fig. 2a) to isolate full-size PDR-type transporters (also known as ABC 
subtype G (ABCG) transporters) of P. hybrida that are abundant in 
phosphate-starved or mycorrhizal roots. The rationale behind the focus 
on these transporters, of which there are 15 in Arabidopsis"', 23 in Oryza 
sativa (rice)* and 23 putative factors in Solanum lycopersicum (tomato) 
(Supplementary Fig. 3a), was that they are plasma membrane proteins 
often found in roots’, they are implicated in below-ground plant- 
microbe interactions'*"*, and they have affinities for compounds that 
are structurally related to strigolactones*”"*. Of six primary candidates, 
only P. hybrida PDR1 had increased expression in roots that were 
subjected to either phosphate starvation (Fig. la) or colonization by 
the arbuscular mycorrhizal fungus Glomus intraradices (Fig. 1b). 
Furthermore, PDR1 transcript levels increased in response to treatment 
with the synthetic strigolactone analogue GR24 or the auxin analogue 
1-naphthaleneacetic acid (NAA) (Fig. 1c). Auxin has been shown to 
upregulate strigolactone-biosynthesis genes’ and to be involved in 
pre-symbiotic and early mycorrhizal events’”. 

P. hybrida PDR1 is predicted to encode a full-size PDR cluster I 
protein (GenBank accession number JQ292813; Supplementary Figs 


2b-c and 3b). The closest Arabidopsis homologue, ABCG40 (also 
known as PDR12), transports abscisic acid (ABA). However, in con- 
trast to Arabidopsis ABCG4O, P. hybrida PDR1 is not regulated by ABA 
(Fig. 1c). A 1.8-kilobase (kb) element upstream of PDR1 (GenBank 
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Figure 1 | Below-ground PDRI1 expression and PDR1 localization. 

a-c, Quantitative PCR for PDR1 in W115 roots: in the presence or absence of 
phosphate (P) (a); at 2-4 weeks post inoculation (w.p.i.) with G. intraradices 
(b); and in response to water (H,O), GR24, NAA or ABA (c). Data are 
presented as mean + s.e.m. (m = 3, where n denotes the number of samples). 
d-h, The pPhPDR1::GUS signal in W115 roots without treatment (d, e) (black 
arrowheads indicate epidermal cells); under phosphate-sufficient and 
phosphate-deficient conditions (f); in response to mycorrhization (+ MYC, 

8 w.p.i.) (g); and in mycorrhized roots (8 w.p.i.) costained with black ink 

(h). Scale bars, 1 mm (d, f-h) and 0.1 mm (e). i, j, Transient CaMV 
35S::GFP::gPaPDR1 expression in Arabidopsis mesophyll protoplasts. The 
GFP::gPaPDR1 signal (i, top) and the corresponding transmission electron 
micrograph (i, bottom) and the free GFP signal (j, top) and transmission image 
(j, bottom) are shown. Scale bars, 10 tm. k-m, pPhPDRI1::GUS signal co- 
localization with trypan-blue-stained root HPCs. Shown are a magenta GUS- 
stained root section (k), an additional trypan blue stain of the same sample 
(1) and stained wild-type root HPCs (m). Scale bars, 0.1 mm. 
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accession number JQ292814) was fused to the GUS reporter and stably 
transformed into the P. hybrida cultivar W115. Below ground, 
pPhPDRI::GUS (promoter P. hybrida PDRI-GUS construct) expres- 
sion was substantial in individual subepidermal cells of the lateral roots 
(Fig. 1d, e). These cells were largely identical to hypodermal passage 
cells (HPCs) (Fig. 1k-m), which are devoid of suberin and serve as 
cortical entry points for arbuscular mycorrhizal hyphae’*. GUS stain- 
ing was more prominent in roots grown under phosphate-deficient 
conditions (Fig. 1f) and in mycorrhizal roots, particularly in regions 
containing or flanking fully developed arbuscular mycorrhizal struc- 
tures (Fig. 1g, h). These results suggested a role for PDR1 in arbuscular 
mycorrhiza during pre-symbiotic development and during intraradical 
colonization. To assess the localization of the PDR1 protein, the geno- 
mic PDR] orthologue from the P. hybrida progenitor P. axillaris (which 
has 99.7% amino acid identity to P. hybrida PDR1; GenBank accession 
number JQ292812; Supplementary Fig. 2b) was fused to the carboxy- 
terminus-encoding region of the green fluorescent protein (GFP) gene 
(GFP::gPaPDR1). Transient expression of GFP::gPaPDR1 in Arabidopsis 
showed that PDR1 localizes to the plasma membrane (Fig. 1i, j), con- 
sistent with a role in secretion. 

For functional analysis, we screened the dT ph1 transposon-containing 
P. hybrida line W138 for insertional pdr1 mutants. A PCR-based DNA 
library screen of 1,000 individuals led to the identification of a dTph1 
insertion in exon 4 of P. hybrida PDR1 (Supplementary Fig. 4a-d), 
together with a footprint allele causing a frame shift (Supplementary 
Fig. 4e). Insertion of dTph1 in the coding region of a gene frequently 
results in a complete loss of function”. 

The W138 line with the mutant pdr1 gene (W138-pdr1) was com- 
pared directly with W138 and was crossed with P. hybrida W115 for 
further segregation analysis. Five homozygous pdr1 mutant lines 
(W115 X W138-pdr1) and wild-type lines (W115 X W138) were 
derived from the F, generation (Supplementary Fig. 4d). Phenotypes 
cosegregated with the mutation in PDRI, and transposon display 
analysis did not uncover other cosegregating dTph1 insertions in the 
W115 X W138 lines (Supplementary Fig. 4f), suggesting that dTph1 
insertion in PDRI is responsible for the observed phenotypes. In 
addition, P. hybrida PDR1 knockdown lines (pdr1-RNAi), which were 
created in W115 by using two independent RNA interference (RNAi) 
constructs (Supplementary Fig. 5a, b), showed similar phenotypes. 

W138-pdr1 had a significantly reduced ability to accommodate 
Gigaspora margarita and G. intraradices (Fig. 2a), two distantly related 
arbuscular mycorrhizal fungi with different growth strategies*. This 
finding indicated that PDR1 functions as a transporter of a stimulatory 
molecule involved in symbiosis with diverse arbuscular mycorrhizal 
fungal species. Indeed, root exudates from W138-pdrl showed 
reduced activity for stimulating the hyphal branching of G. margarita 
in an in vitro bioassay (Fig. 2b). Because these results suggested an 
involvement of strigolactones, W115 X W138-pdr1 root exudates 
were assessed for their ability to stimulate the germination of the 
root-parasitic weed Phelipanche ramosa (Orobanchaceae). As a con- 
trol, root exudates of dad1 mutants from the P. hybrida cultivar V26 
were used. DADI encodes carotenoid cleavage dioxygenase 8 
(CCD8)”° and is an orthologue of the established strigolactone- 
biosynthesis genes RMS1, MAX4 and D10 in Pisum sativum (pea), 
Arabidopsis and rice, respectively”. The germination rate of P. ramosa 
was significantly lower in the presence of root exudates from W115 X 
W138-pdr1 or dad1 mutants than root exudates from the correspond- 
ing wild-type plants, which induced germination to a similar extent to 
GR24 (Fig. 2d). Comparable results were obtained with pdr1-RNAi 
lines (Supplementary Fig. 5c). When inoculated with G. intraradices, 
W115 X W138-pdr1 lines had similarly retarded colonization rates to 
W138-pdr1 and dad1 mutants (Fig. 2c). 

Despite the delay in arbuscular mycorrhizal development, neither 
W115 X W138-pdr1 nor dad1 mutants had any morphological aberra- 
tions in intracellular mycorrhizal structures (Fig. 2e-h). Intraradical 
hyphae and arbuscules appeared normal, suggesting that the quantitative 
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Figure 2 | Below-ground pdr1 phenotypes. a, Arbuscular mycorrhizal 
colonization of W138 and W138-pdr1 roots 8 w.p.i. with two arbuscular 
mycorrhizal fungi. Data are presented as mean + s.e.m. (n > 20). b, In vitro 
branching response of G. margarita 24 h after application of GR24 (n = 5), root 
exudates from W138 (n = 17), root exudates from W138-pdr1 (n = 36) or 10% 
acetone (solvent, n = 5). Data are presented as mean = s.e.m. c, Kinetics of G. 
intraradices colonization of W115 X W138 (n = 25), W115 X W138-pdrl1 

(n = 25), V26 (n = 5) and dad1 mutants (m = 5). Data are presented as 

mean + s.e.m. (P< 0.001 for all time points between mutants and wild types). 
d, P. ramosa germination induced by GR24 (n = 3), root exudates from W115 
X W138 (WXW WT, n= 10), W115 X W138-pdrl (WXW pdr, n = 10), 
V26 (n = 4), dad1 mutants (n = 4) or water (H2O, n = 4). Data are presented as 
mean + s.e.m. - C, negative control. e-h, G. intraradices intracellular 
arbuscular mycorrhizal morphology 4 w.p.i. in W115 X W138 (e), W115 X 
W138-pdr1 (f), V26 (g) and dad1 mutants (h). Scale bars, 20 um. a-d, *, 
P<0.05; ***, P< 0.001. 


differences in colonization were due to a decreased number of 
hyphal penetrations and retarded intraradical expansion of arbuscular 
mycorrhizal fungal colonies, rather than to defects in intracellular 
fungal development. Thus, the phenotype of dad1 and pdr1 mutants 
was distinct from that of arbuscular mycorrhizal mutants such as pam1 
(ref. 21), str1 (ref. 22) or SYM-pathway* mutants, which commonly 
have aberrant arbuscular mycorrhizal fungal structures. 

A detailed analysis of W138 root exudates resulted in the identifica- 
tion of the strigolactone orobanchol (Supplementary Fig. 6). The 
orobanchol levels in pdr1 plant root exudates were significantly lower 
than those in wild-type plant exudates (Fig. 3a), whereas the levels in 
root extracts were not affected (Fig. 3b), indicating that pdr1 mutants 
are not defective in strigolactone biosynthesis. Orobanchol was not 
detectable in root exudates or in root extracts of dad1 plants, confirm- 
ing this mutant’s purported defect in strigolactone biosynthesis 
(Fig. 3a, b). The finding that only extraradical orobanchol levels were 
affected in pdr1 mutants indicated that PDR] functions as a strigolactone 
export carrier. 

PDR1-dependent strigolactone transport was further explored in a 
heterologous system by stable and constitutive overexpression of 
GFP::gPaPDR1 in Arabidopsis Col-0, resulting in PDR1-OE lines (Sup- 
plementary Fig. 7a, b). Arabidopsis does not form arbuscular 
mycorrhizae and exudes only minuscule quantities of strigolactones”*. 
When grown on GR24-containing medium, PDRI-OE lines proved 
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Figure 3 | Orobanchol content and PDR1-dependent GR24 tolerance and 
transport. a, b, Orobanchol in the root exudates (a) and extracts (b) of pdrl 
lines, dad1 mutants and wild-type plants (n = 9). Data are presented as 
mean + s.e.m. FW, fresh weight. c, Col-O and PDR1-OE lines grown on 0, 10 
and 25 uM GR24. Scale bar, 1 cm. d, GR24 export assay of le H]GR24-preloaded 
roots of Col-0, vector control (VC) and PDRI-OE lines (OE L1-3). Relative 
[PH]GR24 in the medium (water), root and shoot after 1h incubation at 4°C 
and 23 °C. Data are presented as mean = s.e.m. (n = 8). a,b, d, *, P< 0.05; ***, 
P<0.001. 


more tolerant to the deleterious effects of high strigolactone concen- 
trations on root elongation™ than did wild-type plants (Fig. 3c and 
Supplementary Fig. 7c). Direct strigolactone exudation was assessed 
by quantifying the efflux of preloaded [*H]GR24 from roots either 
incubated at 4 °C, to monitor passive diffusion, or at 23 °C, allowing 
transporter-mediated efflux. After 1h, PDR1-OE roots incubated at 
23 °C retained significantly less GR24 than PDRI-OE roots incubated 
at 4°C (Fig. 3d and Supplementary Fig. 7d). In agreement with this 
observation, more GR24 was found in the root exudates of PDR1-OE 
lines at 23 °C. No significant differences were found between the root 
extracts and root exudates of wild-type or vector control lines in either 
condition (Fig. 3d). These results, together with the observed GR24- 
resistance phenotype of PDRI-OE plants, are best explained by PDR1 
acting as a strigolactone exporter. 

Taken together, our data suggest a role for PDR1 in strigolactone 
secretion from HPCs. We propose that PDR1-mediated strigolactone 
exudation under low phosphate conditions creates local rhizosphere 
gradients that guide arbuscular mycorrhizal hyphae to HPCs, which 
are susceptible to hyphal penetration (Supplementary Fig. 1), thereby 
initiating arbuscular mycorrhizae. The symbiotic phenotype of pdr1 
and dad1 mutants, and the induction of PDR1 expression in colonized 
root segments, suggests that strigolactones play an additional role in 
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promoting sustained intercellular root colonization, whereas intracel- 
lular stages (for example, arbuscules) develop independently of strigo- 
lactones (Fig. 2e-h). 

Recently, it was demonstrated that strigolactones inhibit shoot 
branching’®. Although certain aspects of strigolactone biosynthesis 
and signalling have been unravelled”, information about the mode 
of strigolactone transport is scant. Strigolactones are mobile within 
the xylem sap*’, but it is not known how they are released from the 
producing cells and whether directed cell-to-cell transport occurs. 
Strigolactone biosynthesis is subject to direct negative feedback regu- 
lation’. Hence, strigolactone biosynthesis needs to be coordinated 
with export to prevent strigolactones from accumulating to levels that 
would restrict further production. Indeed, PDR1 expression was found 
to be stimulated by exogenous application of GR24 (Figs 1c and 4a), 
suggesting that this gene is induced in a substrate-dependent manner, 
as previously observed for other PDR subfamily”’* members. 

Above ground, PDR1 expression was largely confined to stem tissues, 
particularly the vasculature and nodal tissues adjacent to leaf axils 
(Fig. 4b, c). However, PDR1 expression was absent from dormant 
buds (Fig. 4d). This pattern is consistent with PDR1 functioning as 
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Figure 4 | Above-ground PDR1 expression and pdr1-related branching 
phenotypes. a, b, Quantitative PCR for above-ground W115 GR24-treated 
tissue (a) and various organs (b). Data are presented as mean + s.e.m. ( = 3). 
c, d, pPhPDR1::GUS in W115 at the four-leaf stage (c) and in the node 

(d) (white arrowhead, dormant axillary bud). The inset in ¢ shows a 
magnification of the lower right corner of c. Scale bars, 10 mm (c) and 1 mm 
(d). e, f, Branch development 41 days post germination. Data are presented as 
mean ~ s.e.m. Branch length is shown for W115 X W138 and W115 X W138- 
padr1 (n = 110) (e) and for W115 and two pdr1-RNAilines, R104 (P < 0.001 for 
node 3-4) and C244 (P < 0.05 for node 3) (n = 8) (f). g, h, Effects of GR24 (+, 
striped bars) on branch development 34 days post germination at node 3-5 for 
W115 X W138 (WT) and W115 X W138-pdr1 (pdr1) (n = 24) (g) and for V26 
and dad1 mutants (n = 8) (h). Data are presented as mean + s.e.m. a, b, e, h, *, 
P<0.05; ***, P< 0.001. 
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a strigolactone transporter. Although strigolactones are mobile in the 
xylem”, a cellular transport system is required to deliver strigolactones 
to dormant buds that are not yet connected to the xylem. This idea is 
compatible with both current models of strigolactone-dependent 
branching control’’. 

According to the ‘second messenger model’, strigolactones are trans- 
ported to the bud as a second messenger of auxin”, hence cellular 
transport of strigolactones in the axillary regions would be indispensable. 
In the ‘auxin transport canalization-based model’, strigolactones are 
thought to dampen polar auxin transport, resulting in the accumula- 
tion of auxin to levels that inhibit bud outgrowth”’. Strigolactones 
could restrict auxin transport systemically and/or locally*’. For both 
models, local strigolactone transport capacity near the axils would be in 
line with the inhibitory role of strigolactones on branching. In W115 X 
W138-pdr1 plants, bud outgrowth was initiated sooner (Supplemen- 
tary Fig. 8a) and more vigorously than in the wild-type plants, causing 
longer branches (Fig. 4e and Supplementary Fig. 8e-h) at node three to 
five. This early and vigorous bud outgrowth was also observed in pdr1- 
RNAi lines (Fig. 4f). Dad1 mutants initiate branches from all nodes 
(Supplementary Fig. 8d), and although branch elongation is retarded 
compared with the wild type, these mutants eventually produce full 
branches from every node**”®. At flowering time, this results in a 
phenotype that is more pronounced than in any of the pdr1 mutants, 
which have final branch patterns that differ only marginally from the 
respective wild-type plants (Supplementary Fig. 8b, c and Supplemen- 
tary Table 1). Nevertheless, the pdr1 mutant branching phenotype 
seems to be strigolactone-dependent and related to the dad1 mutant 
branching phenotype, because branch elongation in both mutants 
could be suppressed to near wild-type conditions by exogenous 
application of GR24 to the leaf axils (Fig. 4g, h). 

In conclusion, the identification of PDR1-mediated strigolactone 
transport contributes to a more comprehensive view of the modes of 
action of strigolactones. Understanding the underlying transport 
mechanisms is crucial for a holistic view of phytohormone function. 
Strigolactone transport has a direct effect on the phosphate-dependent 
control of arbuscular mycorrhizal levels and on the control of shoot 
branching, for which the integration of auxin and strigolactone signal- 
ling seems to be achieved partially through reciprocal transport modu- 
lation. The mild branching phenotype of pdr1 mutants relative to the 
strigolactone-biosynthesis mutant dad1 (ref. 30) suggests that residual 
transport and/or locally produced strigolactones may compensate for 
defective strigolactone transport in the shoot. However, arbuscular 
mycorrhizal development was affected to a similar degree in the pdrl 
and dad1 mutants, revealing that below-ground strigolactone transport 
and secretion relies primarily on PDR1. 


METHODS SUMMARY 


All experiments, with the exception of the transport assays, were performed in 
accordance with established protocols. Detailed methods and associated references 
can be found in the Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Plant growth conditions. Petunia lines were grown with a 16h light cycle, at 60% 
relative humidity and 25 °C in soil (ED 73, Einheitserde) or in clay granules (Oil 
Dri US-Special, Damolin). Clay granules were supplemented once a week with 
half-strength Hoagland solution. For mycorrhization trials, a mix of 40% (v/v) soil, 
40% (v/v) clay granules, 10% (v/v) sand and 10% (v/v) mycorrhizal inoculum 
(Agrauxine) was used. Seeds were plated on medium containing 2.2g1 ' MS 
medium (Duchefa) and 15 gl * sucrose, supplemented with 9g] Phyto Agar 
(Duchefa), with a 16 h light cycle at 25 °C. Arabidopsis was grown in vertical plates 
on medium containing 2.2 g1~' MS medium and 15 g1~' sucrose, supplemented 
with 9 gl‘ Phyto Agar, with a 16h light cycle and 60% relative humidity at 21 °C. 
For hormone treatment, 14-day-old W115 seedlings grown on plates were exposed 
for 24h to final concentrations of 11M or 10M of the synthetic strigolactone 
analogue GR24 (Chiralix), 10 1M ABA or 25 tM NAA. For phosphate starvation, 
14-day-old W115 seedlings were transferred to phosphate-free plates for 1 week. 
PDRI cloning strategy. PDR-specific 0.5-kb transcripts were amplified from 
W115 root cDNA 5 weeks post inoculation (w.p.i.) with G. intraradices, using 
the following primers: 5’-mgwatgactctdytkytkggacctcc-3’ and 5’-gyttcytytgncche 
chgaaatwcc-3’ (5' region), or 5'-gggwaaracggwetyagtggwgcw-3’ and 5’-ctcatnaca 
atdgcwgcwgctctwgc-3’ (3' region). Sequences follow the universal degenerate code. 
The resultant 5’ and 3’ fragments were aligned, and the deduced consensus primers 
5'-TATTGGGACTTGAAATTTGTGCCGATAC-3’ and 5'-GCTCCACTAACAC 
CCATCAGAGCTGTC-3’ were used to amplify putative PDR fragments spanning 
2.5 kb. The 5’ and 3’ ends of P. hybrida PDR1 were amplified using a SMART 
RACE Amplification Kit (Clontech) with 5’ RACE (5'-CTCGAGTACATTTTCT 
CGGGGACCTTGG-3’), nested 5’ RACE (5'-CCATTTCGTCTCCAACAATG 
GTATCGG-3’), 3’ RACE (5'-GTCCTCAAGAGTAGGAAGCATCACTGCG-3’) 
and nested 3’ RACE (5’-ACCGAGGACCGGCTTGAACTCTTGAGAG-3’). The 
full-length P. hybrida PDR1 cDNA GenBank accession number is JQ292813. 

To obtain the full-length genomic sequence of PDRI, a P. axillaris BAC library 

(from C. Kuhlemeier) was screened with 5’-TGCCAATCCTTCATGATGTCA 
GTGG-3' and 5'-CCTTCTCTCTCCTAGACAGCTCTGC-3’. P. axillaris is a 
progenitor of the hybrid species P. hybrida*', anda genomic library for BAC-based 
cloning was available only for P. axillaris. BACs were extracted from candidate 
clones via the Large-Construct Kit (QIAGEN). Full-length genomic P. axillaris 
PDRI1 was amplified from a BAC with 5’-AATTACTAGTATGGAGGGTG 
GTGAAG-3’ and 5'-AATTGCATGCCTATCTTTTCTGGAAATTAAATG-3’, 
cut with Spel and Sphl, and cloned into the vector pUC18-GFP5Tsp through 
compatible Nhel and Sphl restriction sites, for use in GFP localization studies”. 
For stable transformation, the CaMV 35S::GFP::gPaPDR1-terminator cassette was 
cloned from pUC18-GFPS5Tsp into the pGreenII 0179 vector* by using the fol- 
lowing strategy. First, the CaMV 35S promoter from native pUC18-GFP5sp was 
cloned via XhoI and Xmal sites. Second, the terminator from the native pUC18- 
GEP5Tsp, including the upstream Sphl site, was cleaved with Nhel and SacI and 
inserted into CaMV 35S-pGreenII 0179 via compatible Xbal and SacI sites. Third, 
GFP::gPaPDR1 was cut and cloned via Xmal and SphI. Nine kilobases of the 
genomic P. axillaris PDR1 locus, including upstream and downstream sequence, 
were submitted to GenBank under accession number JQ292812. 
PDRI1 promoter GUS construct and GUS staining assay. A 1.8-kb P. hybrida 
PDR1 promoter fragment was amplified using a GenomeWalker Universal Kit 
(Clontech) with 5’-AGTTGGAAGTTTCTCAAGTGCAGCCCA-3’ and the 
nested 5'’-CCCTAAAGAGTTCTTCACCACCCTCCAT-3’ (GenBank accession 
number JQ292814). The fragment was cloned into the pGEM-T-Easy Vector 
system (Promega), reamplified with 5’-CATGAAGCTTGCACCCAGAAGAAG 
ATTAGGC-3’ and 5'-TCGATCTAGACACATTAAGAGGAAAGTAGGTAC-3’ 
and cloned into the pGPTV-BAR™ vector system via HindIII and Xbal. Of the 
original TO transformants, eight lines were selected for further analysis. Segregating 
T1 individuals of all eight lines had comparable below-ground and above-ground 
expression patterns at different developmental stages. Two of these lines were 
chosen for the in-depth analysis presented in this work, and all data presented were 
confirmed in both lines. 

GUS-staining trials were performed as described previously*’. After stain- 
ing, samples were cleared for 24h in 10% (w/v) KOH and stored in 70% (v/v) 
ethanol. For the analysis of HPCs, 5-bromo-6-chloro-3-indolyl B-p-glucuronide 
cyclohexylammonium salt was used for GUS staining, and samples were cleared for 
24h in 10% (w/v) KOH before staining with trypan blue as described previously’*. 
PDR1 RNAi constructs. Silencing of P. hybrida PDRI1-specific transcripts was 
performed with the pkKANNIBAL vector system**. Two constructs were designed: 
one targeting a highly variable region within nucleotide binding domain 2 (NBD2) 
of PDRI (the C construct), and the other targeting part of the 3’ end and the 3’ 
untranslated region of PDR1 (the R construct). The 148-base-pair (bp) C fragment 
(5'-GGAACGCAAGCAAAAGGGGTGAGGTTATTGAACTATCTTCGCTTGG 
AAAGAGCTCTTCTGAAAAAGGAAATGATGTTCGGCGAAGTGCATCTTC 
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CAGGTCAATGTCCTCAAGAGTAGGAAGCATCACTGCGGCTGATTTGAG 
CAAGAG-3') was amplified from W115 cDNA with 5’-CGATGGATCCTCGA 
GGGAACGCAAGCAAAAGGG-3’, containing BamHI and Xhol restriction sites 
and with 5'-CGATATCGATGGTACCCTCTTGCTCAAATCAGCCGCAGTGA-3’ 
containing Clal and Kpnl sites. The 411-bp R fragment (5'-GACATTATATG 
GACTAATTGCCTCACAATTTGGAGACATACAAGACAGACTTGACACAA 
ATGAGACAGTGGAACAATTCATAGAGAATTTCTTTGATTTCAAACATG 
ATTTTGTGGGATATGTTGCTCTCATTCTTGTTGGGATTTCTGTTCTTTT 
TCTCTTCATTTTTGCATTTTCAATTAAAACATTTAATTTCCAGAAAAGA 
TAGGTTGGTCCAGGTATACACATGAAAAGAGCGTTTATCAAGATATGT 
GTATATTAGGATAATAATATAATCTTTCTTTTTCCTCTTTTTTACTTATT 
GTGGTTTTCTCAAGTTTGGAATAGATAGAACCAAAAGTCTGTACTCTG 
TATTTAAGAACAACTTTTGTACACATTGTTATGTATTGGAGAAGTTATG 
AGTATCTTTTG-3’) was amplified with 5’-CGATGGATCCTCGAGACATTA 
TATGGACTAATTGCC-3’ containing BamHI and Xhol restriction sites and with 
5'-CGATATCGATGGTACCAAAAGATACTCATAACTTCTCC-3’ containing 
Clal and KpnI sites. The resultant amplicons were cloned in the sense and 
antisense direction in the two multiple cloning sites of pkANNIBAL. The 
pKANNIBAL RNAi cassette was excised from the vector backbone via NotI and 
transferred into the binary pGreenII 0229 vector**. After stable transformation of 
W115, the extent of downregulation was estimated by semiquantitative PCR or 
PCR with reverse transcription. 

Plant transformation. W115 was transformed as described previously’. 
Construct insertion was confirmed by PCR on genomic DNA with the primers 
5'-ACGGTCCACATGCCGGTATATACGATG-3’ and 5’-GATGGCATTTGTA 
GGAGCCACCTTCC-3’, which target the CaMV 35S promoter, or with 5’-GAA 
TTGATCAGCGTTGGTGGGAAAGC-3’ and 5'-GGTAATGCGAGGTACGGT 
AGGAGTTG-3’, which target the GUS gene. 

Transient transformation of Arabidopsis Col-0 protoplasts was performed as 
described previously’. Arabidopsis plants were stably transformed as described 
previously **. The TO generation was selected for hygromycin resistance. Plants of 
the T1 generation were tested for hygromycin resistance and GFP expression. 
Screening approach to identify transposon insertions in PDR1. A 3D-gDNA 
library (from T. Gerats) representing 10 X 10 X 10 (1,000) W138 individuals was 
screened for dTph1 insertions in P. hybrida PDR1 using a PCR-based method”. 
The entire genomic region of PDRI was scanned in contiguous steps covering less 
than 1kb, using the dTph1-specific primer 5’-GAATTCGCTCCGCCCCTG-3’ 
anda variety of **P-labelled gene-specific primers. The primer 5’-CCATTTCGTC 
TCCAACAATGGTATCGG-3’ yielded a positive result. Homozygosity PCRs 
were performed with transposon-flanking primers: 5’-tgccaatccttcatgatgtcagtge-3' 
and_5’-ccttctctctectagacagctctgc-3’. Homozygous dTph1 insertion alleles were 
crossed into W115, the progeny were selfed, and the resultant offspring were tested 
for homozygosity with the abovementioned primers. Transposon display analysis 
using six W115 X W138 and five W115 X W138-pdr1 lines was performed as 
described previously”. 

Mycorrhization trials. Subsets of mycorrhized roots were stained*’ and quan- 
tified for their level of colonization using the gridline intersect method’. Only 
roots with clear intraradical structures such as coiled cortical hyphae, arbuscules 
and vesicles were scored as positively mycorrhized. A minimum of 200 intersect- 
ing root fragments per sample were investigated microscopically for intraradical 
arbuscular mycorrhizal structures. For trials involving W115 X W138 lines, five 
individuals from five lines were analysed, and the data were presented as a pool of 
n= 25 (5 X 5). Double staining of the colonized roots with propidium iodide and 
wheat germ agglutinin coupled to fluorescein isothiocyanate (WGA-FITC, Sigma- 
Aldrich) was performed as described previously*’. 

Hyphal branching bioassays. Branching assays were performed as described 
previously“ with preselected spores of G. margarita (Agrauxine). For the produc- 
tion of the root exudate concentrate, petunia lines were grown in clay granules and 
then transferred for 24h to 0.11 of a hydroponic solution containing 2mM CaCl, 
and 2 mM KSO, and kept under constant aeration. The hydroponic solution was 
then passed through a Sep-Pak Classic C18 Cartridge (Waters) to adsorb hydro- 
phobic root exudates. Exudates were eluted from the column using 2 ml of acetone, 
and the eluent was dried over nitrogen. Dried exudates were redissolved in acetone 
and normalized according to root fresh weight. Exudate equivalents of 10 mg root 
fresh weight were used in each branching assay. 

Transport assays and GR24 tolerance assays. Arabidopsis seeds from three 
independent PDRI-OE lines were surface-sterilized with 1% (v/v) bleach and 
50% (v/v) ethanol and plated on media supplemented with 2.2 g1”' MS medium, 
1% (w/v) sucrose and 0, 10 or 25 uM GR24. After three days of stratification, plants 
were moved to a 16h light cycle and selected for GFP fluorescence after 3 days of 
growth. Root length was determined with Image] 1.44 software (http://rsbweb.- 
nih.gov/ij) after 7 days, and seedlings were moved to hygromycin-containing 
plates without sucrose to confirm the selection by GFP fluorescence. 
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For transport experiments, seeds were sterilized in the same way and plated on 

hygromycin-containing medium. After 3 days of stratification and 3 days of 
growth, seedlings were checked for GFP fluorescence. After 7 days, GFP- and 
hygromycin-positive plants were transferred to medium supplemented with 
2.2g1°' MS medium and 1% (w/v) sucrose and grown for another 7 days. 
Three PDRI-OE lines, Col-0 and a vector control line were incubated for 2h in 
the dark at 4 °C with root tips submerged in 0.1% (w/v) Phyto Agar supplemented 
with 25nM [*H]GR24 (specific activity 40 Cimmol'; American Radiolabeled 
Chemicals). Subsequently, the plant roots were washed in ice-cold 1 mM CaCl, 
and incubated in 200 pl 0.1% Phyto Agar. For each line, 50% of the plants were kept 
for a further 1 h at 4 °C in the dark as a diffusion control, and the other 50% were 
shifted to 23°C for 1h to monitor transport. Subsequently, the shoot, root and 
Phyto Agar fractions were incubated for 30 min in 50 pl 24% (w/v) trichloroacetic 
acid at 23°C. *H counts were determined in 3ml ULTIMA Gold LSC cocktail 
(Perkin Elmer) with a Liquid Scintillation Analyser Tri-Carb 2900TR (Packard 
BioScience). The number of disintegrations per minute was computed into a 
percentage for each fraction and normalized to tissue fresh weight. 
RNA isolation, cDNA synthesis, semiquantitative PCR and quantitative RT- 
PCR. RNA was isolated with an RNeasy Plant Mini Kit (QIAGEN). Reverse tran- 
scription of RNA to cDNA was performed with M-MLV Reverse Transcriptase 
(Promega) and a poly(T) primer (Promega). 

P. hybrida PDR1 expression was quantified semiquantitatively with the primers 
5'-GAAACTGTGGCCGAAAGG-3’ and 5’-GAGTTCAAGCCGGTCCT-3’ or 
5'-AAATGCTACTACAGTGCAG-3’ and 5'-CATATAATGTCCAGGAAATG 
GG-3'. Tubulin 1 (TUB) transcripts, partially amplified with 5’-CATTG 
GTCAAGCCGGTTATTC-3’ and 5’-ACCCTTGAAGACCAGTACAGT-3’, 
served as a housekeeping and loading control. 

Samples were diluted 1:30, and the diluted samples (4 jl) were added to each 
reaction well, serving as template for the reaction. Deionized water served as a 
negative control for amplification. PDR1 expression was quantified with the primers 
5'-CCTGAGGTTTACCAAATGGG-3’ and 5'-GATGGTATTGGATTGGAGCA-3’. 
Glyceraldehyde 3-phosphate dehydrogenase (GAPDH) expression was quantified 
with the primers 5’-GACTGGAGAGGTGGAAGAGC-3’ and 5’-CCGTTAAG 
AGCTGGGAGAAC-3’. GAPDH served as a housekeeping gene for normaliza- 
tion because it has been shown not to be regulated by hormonal treatments or 
mycorrhization (D. Reinhardt, personal communication). Final primer concen- 
trations of 50, 100, 200 and 300nM were tested for cDNA amplification and 
melting behaviour in the range of 60°C to 95°C. Because no differences were 
recorded, the average concentration of 100nM was chosen for further experi- 
ments. Primer efficiency was recorded with W115 root cDNA as a template in a 
dilution range of 1:1 to 1:512, resulting in 94.42% for P. hybrida PDR1 and 98.561% 
for GAPDH. These values were taken into account in the calculations. SYBR Green 
PCR Master Mix (Applied Biosystems) was added to the samples to a 20 ll final 
volume. For each sample, three technical replicates were pipetted. RT-PCR was 
performed on a 7500 Fast Real-Time PCR System (Applied Biosystems) with 7500 
Software version 2.0.4. For quantification, comparative Cy (AAC;) was chosen as 
the method”; the PCR run was divided into three parts: first, the hold stage (50 °C 
for 2 min and 95 °C for 10 min); second, the cycling stage (95 °C for 15s and 60 °C 
for 1 min for 40 cycles); and third, the melt curve stage (95 °C for 15 s, 60°C to 
95°C over 1 min, 95°C for 30s, and 60°C for 15s). Relative differences were 
calculated as described previously****. Each experiment was performed for three 
biological replicates. 

Isolation, identification and quantification of petunia strigolactones. Plants 
were grown in an X-Stream 20 aeroponic system (Nutriculture) as previously 
described for Medicago truncatula’’. From day 8 until day 12, exudates were 
collected and pooled, and root material was sampled. Samples were stored at 
—80°C until analysis. P. hybrida root exudates and extracts were prepared and 
analysed by ultra performance liquid chromatography coupled to tandem mass 
spectrometry (UPLC-MS/MS) as previously described for Arabidopsis’. 
Orobanchol was provided by K. Yoneyama. For trials involving W115 W138 
lines, three individuals from three lines were analysed, and the data are presented 
as a pool of n = 9 (3 X 3). 

P. ramosa germination bioassay. Germination assays with P. ramosa seeds were 
conducted as reported previously'’. Exudates were prepared as described above (in the 
subsection “Hyphal branching bioassays’). GR24 (0.1 or 1.0nM) and demineralised 
water were included as positive and negative controls, respectively. P. ramosa seeds 
were provided by M. Vurro. For trials involving W115 X W138 lines, two individuals 
from five lines were analysed, and the data are presented as a pool of n = 10 (5 X 2). 


Trypan blue staining of HPCs. Trypan blue staining of HPCs in roots was 
performed as described previously’’. 

Axillary branching trials. For a comparative analysis of lateral branch production 
in wild-type and pdr1 backgrounds, plants were grown for 65 days in 0.551 pots in 
soil as described above (in the subsection ‘Plant growth conditions’) and were 
watered daily. Branch development was monitored at different time points in a 
binominal fashion (yes/no) with respect to the following parameters: bud length 
>7 mm and full branch. Full branches were scored in accordance to a petunia branch 
definition**. Furthermore, branch length was scored as a continuous parameter. For 
trials involving W115 X W138, 22 individuals from 5 lines were analysed, and the 
data are presented as a pool of n = 110 (5 X 22). For branch length trials in response 
to GR24 treatments, three lines each of W115 X W138, W115 X W138-pdr1, V26 
and the dad1 mutant (from K. Snowden) were grown in soil as described above (in 
the subsection ‘Plant growth conditions’). From 25 to 40 days post germination, 
plants were treated three times each week with 0 4M or 10 uM GR24 as described 
previously®. For trials involving W115 X W138 lines, eight individuals from three 
lines were analysed, and the data are presented as a pool of n = 24 (3 X 8). 
Statistical analyses. Depending on the experimental set-ups and prerequisites, 
Students t-tests, Fisher’s Exact tests or generalized linear models (GLMs) with 
quasi-binominal error structures were applied using R software. 
Bioinformatics. Analysis of DNA fragments and vector constructs was performed 
using Vector NTI software (Invitrogen). The membrane topology of PDR1 was 
predicted using ConPredII”’. Phylogenetic analysis of PDR1 was performed using 
tools available at Phylogeny.fr (http://www.phylogeny.fr). Alignments were per- 
formed with MultAlin software (http://multalin.toulouse.inra.fr/multalin/). 
Robustness of data sets. All data sets presented were confirmed in at least two 
independent trials with similar set-ups and outcomes. For mycorrhization and 
branching trials, individual pots were randomized to reduce positional effects, and 
sample size was kept high to reduce background effects. 
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Goblet cells deliver luminal antigen to CD103* 
dendritic cells in the small intestine 
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The intestinal immune system is exposed to a mixture of foreign 
antigens from diet, commensal flora and potential pathogens. 
Understanding how pathogen-specific immunity is elicited while 
avoiding inappropriate responses to the background of innocuous 
antigens is essential for understanding and treating intestinal 
infections and inflammatory diseases. The ingestion of protein 
antigen can induce oral tolerance, which is mediated in part by a 
subset of intestinal dendritic cells (DCs) that promote the develop- 
ment of regulatory T cells’. The lamina propria (LP) underlies the 
expansive single-cell absorptive villous epithelium and contains a 
large population of DCs (CD11c* CD11b* MHCII* cells) com- 
prised of two predominant subsets: CD103* CX3;CR1~ DCs, which 
promote IgA production, imprint gut homing on lymphocytes and 
induce the development of regulatory T cells*°, and CD103~ 
CX;CR1* DCs (with features of macrophages), which promote 
tumour necrosis factor-a (TNF-a) production, colitis, and the 
development of T};17 T cells*”'°. However, the mechanisms by 
which different intestinal LP-DC subsets capture luminal antigens 
in vivo remains largely unexplored. Using a minimally disruptive 
in vivo imaging approach we show that in the steady state, small 
intestine goblet cells (GCs) function as passages delivering low 
molecular weight soluble antigens from the intestinal lumen to 
underlying CD103* LP-DCs. The preferential delivery of antigens 
to DCs with tolerogenic properties implies a key role for this GC 
function in intestinal immune homeostasis. 

We examined the in vivo antigen acquisition behaviour of intestinal 
LP-DCs in fluorescent DC-reporter mice using two-photon microscopy 
(Supplementary Fig. 1a). The intestine is imaged within the peritoneal 
cavity and images can be acquired from either the intact intestinal serosa 
or from the luminal surface through a small longitudinal incision in the 
intestine (Fig. la, b and Supplementary Movie 1, upper panels). The 
preparation is sufficiently stable to permit three-dimensional imaging of 
DC behaviour deep within intestinal tissues (Fig. 1c and Supplementary 
Movie 1, lower left panel) and preserves blood flow and epithelial barrier 
integrity for more than 4h of continuous imaging. 

We assessed antigen distribution by two-photon microscopy fol- 
lowing the intraluminal injection of 10 kDa rhodamine dextran as a 
model antigen. Dextran coated the surface of the epithelium and filled 
the space between villi and crypts (Fig. 1a, b). In addition, we observed 
cylindrical dextran columns approximately 51m in diameter and 
about 20 um long projecting through the villus epithelium and into 
the LP, when imaging from either serosal or luminal orientations 
(Fig. la-c, Supplementary Fig. 1b and Supplementary Movie 1, upper 
panels and lower right panel). Transepithelial dextran columns were 
common throughout the small intestine from duodenum to ileum, but 
did not cause a general disruption of the epithelial barrier as shown by 
the exclusion of dextran from the LP (Fig. la-c). We did not detect 
transepithelial dextran columns in the stomach, caecum, or colon, 
with the exception of the epithelium overlying the caecal patches 


(Supplementary Fig. lc-e and Supplementary Movie 2). Confocal 
microscopy revealed that dextran columns were a subset of epithelial 
cells containing intracellular dextran, which had a continuous border of 
e-cadherin on their basolateral surface and were often in contact with 
yellow fluorescent protein (YFP*) cells in the LP of CD11c-YFP 
reporter mice*’ (Fig. 1d, e and Supplementary Movie 4, right panel). 
Periodic acid—Schiff staining of mucin in sections of the small intestine 
(Fig. 2a) produced a goblet cell (GC) staining pattern similar in fre- 
quency, distribution and dimensions to the dextran columns identified 
by two-photon microscopy (Fig. 2b, c). Furthermore, in contrast to the 
acellular and impermeable discontinuities seen in the small intestine 
epithelium”’, dextran columns were associated with a nucleus (Fig. 2d, 
Supplementary Fig. 2a and Supplementary Movie 3). To determine if 
the dextran-filled cells were in fact GCs, sections of intestine from mice 
given lysine-fixable dextran were stained with antibodies to mucin 2 
(MUC2) and cytokeratin 18, which are both highly expressed by GCs”. 
Dextran columns showed near perfect colocalization with MUC2* and 
cytokeratin 18° epithelial cells displaying GC morphology (Fig. 2e, f). 
Therefore, we term this phenomenon ‘goblet-cell-associated antigen 
passages’ (GAPs). To address the possibility that GAPs are apoptotic 
GCs, we co-stained for various markers of apoptosis, including cleaved 
cytokeratin 18, cleaved caspase 3 and TdT-mediated dUTP nick end 
labelling (TUNEL; Supplementary Fig. 3a-i). In all cases, we found no 
association between apoptotic GCs and GAPs. Moreover, GAPs are 
distinct from villous M cells, because they did not colocalize with the 
M-cell marker glycoprotein 2 (GP2) (Fig. 2g)'*. The frequency and 
distribution of GAPs assessed by two-photon microscopy was similar 
in all strains of specific-pathogen-free (SPF) mice examined (Sup- 
plementary Fig. 2b-d), with a non-significant trend toward more 
GAPs detected in the terminal ileum (Supplementary Fig. 2h). GAPs 
were also evident in human jejunum resection specimens (Fig. 2h, i), 
suggesting that GAPs are a general phenomenon of the healthy small 
intestine. We examined the frequency of GAPs in C3H/HejBir IL- 
10 ‘~ mice'4, which develop spontaneous intestinal inflammation with 
GC loss, and in germ-free mice that lack normal gut flora. The number 
of GAPs and GCs correlated strongly; GAPs and GCs were significantly 
more numerous in germ-free mice (Supplementary Fig. 2e, g) and 
significantly fewer in IL-10 ‘~ mice (Supplementary Fig. 2f, g). 
Previous studies have shown that LP-DCs can extend transepithelial 
dendrites (TEDs) between intestinal epithelial cells to sample luminal 
contents and microbiota'*'*. However, these studies used ex vivo and 
exteriorized tissue preparations that involved removing the luminal con- 
tents and mucous before imaging. Using our in vivo imaging preparation 
and confocal microscopy, we found that although LP-DCs probed the 
epithelium actively with their dendrites (Fig. 1c and Supplementary 
Movie 1, lower left panel, and Supplementary Movie 4), they did not 
extend TEDs into the intestinal lumen to capture fluorescent antigen in 
healthy mice (based on over 50 independent intravital imaging experi- 
ments examining all regions of the small intestine from the tip of the 
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Figure 1 | Steady-state transepithelial delivery of luminal material in the 
mouse small intestine. a, b, Intestines from anaesthetized CD11c-YFP reporter 
mice injected intraluminally with 10 kDa dextran (red) imaged with intravital 
two-photon microscopy from the (a) serosal or (b) luminal surfaces. Optical 
sections at increasing depths (shown in jm) revealed dextran in the lumen 
(asterisks), the crypts (yellow arrows), on the epithelial surface (unfilled arrow) 
and columns of dextran that traversed the epithelium (white arrows). Dextran 
was generally excluded from the LP as identified by CD11c-YFP* LP-DCs 
(green) below the DAPI-stained epithelial nuclei (blue in b). Scale bars, 100 pum. 
c, Time-lapse recording of an LP-DC dendrite (green, white arrow) making 
repeated contacts with a dextran column (red) crossing the epithelium (DAPI- 
stained nuclei, blue). Scale bar, 50 ,um; time stamp shows min:s elapsed time from 
the start of imaging. d, Rendered confocal image of a CD11c-YFP* DC (green) 
in contact with a dextran-filled epithelial cell (red). Panels show orthogonal 
views, contact is indicated by the white arrow. Scale bar, 5 tum. e, Confocal image 
of a dextran-containing cell (red) bordered by a continuous e-cadherin-positive 
(blue) surface (white arrow). Asterisk indicates the position of an optical section 
near the cell’s centre showing intracellular dextran. A CD11c-YFP* DC (green, 
yellow arrow) is positioned near the base of the epithelium. Orthogonal 
projection (bottom panels); red channel removed (right panels). Scale bar, 5 jim. 


villi to the base of the crypts). To confirm that our imaging approach 
can readily detect TEDs, we imaged CD1lc-YFP reporter mice 
challenged with Salmonella typhimurium, which has been shown to 
upregulate TED formation’’. Although rare LP-DC TEDs were 
observed in approximately 2% of villi, TEDs did not mediate the uptake 
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of luminal dextran or beads on the basis of fluorescence colocalization 
(Supplementary Fig. 4a and Supplementary Movie 5, upper left panel). 

Next, we examined whether paracellular leak’? could serve as a 
major source of luminal antigen for small intestine LP-DCs. The 
intraluminal injection of 5mg of 10kDa dextran produced a faint 
‘feather’-like staining pattern between villous epithelial cells, consist- 
ent with paracellular leak (Supplementary Fig. 4b and Supplementary 
Movie 5, upper right panel). Time-lapse two-photon imaging showed 
that dextran collected at the base of the epithelium, but was flushed out 
of the villi efficiently during contraction and did not remain coloca- 
lized with LP-DCs even in areas of extensive paracellular leak 
(Supplementary Fig. 4b and Supplementary Movie 4, left panel, and 
Supplementary Movie 5, upper right panel). In addition, we did not 
detect paracellular leak around GAPs in fixed sections by confocal 
microscopy, despite the more permissive tight junctions of GCs”. 
However, we cannot exclude the possibility that LP-DCs capture low 
levels of antigen via paracellular leak, because this process might be 
below the level of detection of our imaging approach. 

In contrast to TEDs and paracellular leak, two-photon time-lapse 
imaging provided direct evidence that GAPs are a source of luminal 
antigen for LP-DCs. In addition to dextran (Fig. 3a), GAPs were cap- 
able of transporting protein antigens (Fig. 3b). Although most GAPs 
remained visible for the duration of our imaging experiments, they 
were a dynamic phenomenon (Supplementary Movie 5, lower panels). 
Moreover, the manner in which LP-DCs interacted with GAPs varied. 
In some cases, DCs made stable contacts and slowly collected antigen 
over several minutes (Fig. 3a and Supplementary Movie 5, lower left 
panel) whereas in others, DCs actively probed GAPs and captured 
clumps of antigen (Fig. 3b and Supplementary Movie 5, lower right 
panel). We assessed the molecular weight exclusion limit of GAPs and 
found that beads ranging from 0.02 to 1.0 um in size did not enter 
GAPs (Supplementary Fig. 4c). In contrast, GAPs filled rapidly with 
10 kDa dextran (Supplementary Fig. 4d) and dextran colocalized with 
CD11c-YFP*LP-DCs 2h after intraluminal injection (Fig. 3a and 
Supplementary Movie 5, lower left panel). GAPs also filled with larger 
dextrans (Supplementary Fig. 4d); however, capture by LP-DCs was 
markedly reduced with 70kDa dextran and undetectable with 
2,000 kDa dextran during our 4-h imaging window. 

The small intestine LP contains two prominent DC populations: 
CX;CR1~ CD103* DCs with tolerogenic potential and CX;CR1~ 
CD103° DCs that have features of macrophages and have been impli- 
cated in intestinal inflammation®’®. We visualized LP-DC subsets 
in vivo using dual-reporter mice created by crossing CD11c-YFP mice 
with CX3CR1-GFP knock-in mice (ref. 28) in which CD103 and 
CD103* LP-DCs can be distinguished by the presence or absence of 
CX;CR1-GFP expression, respectively (Fig. 3c and Supplementary Fig. 
5). We frequently observed CD11c-YEP* CX3CR1-GEP LP-DCs 
sampling GAPs by two-photon microscopy, but this behaviour was 
rare in CDllc-YFP* CX;CR1-GFP°LP-DCs (Fig. 3c and 
Supplementary Fig. 6c); out of 50 total LP-DCs containing dextran, 
49 were CD11c-YFP* CX3;CRI1-GFP . Furthermore, when we directly 
observed antigen transfer from GAPs to DCs (20 out of 500 GAPs 
examined), GAPs delivered antigen exclusively to CD11c-YFP* 
CX;CR1-GFP LP-DCs (Fig. 3d). Immunofluorescence microscopy 
confirmed that the cells interacting with GCs were CD103* CD11c™ 
LP-DCs and not B220* plasmacytoid DCs (pDCs) (Supplementary 
Fig. 6a). Moreover, flow cytometry showed that luminal antigen was 
captured preferentially by CD103* DCs at a proportion of roughly 10:1 
over CD103° DCs (Fig. 3e) and rarely colocalized with pDCs (Sup- 
plementary Fig. 6b), consistent with the sampling bias observed by two- 
photon imaging. Control experiments demonstrated that antigen 
uptake during cell isolation was negligible (Supplementary Fig. 7), thus 
the flow cytometry results measure the in vivo antigen acquisition 
capacity of different LP-DC subsets. Interestingly, CD103* LP-DCs, 
but not CD103” LP-DCs, often stained positive for cytokeratin 18 
(Fig. 3f and g), which is highly expressed by villous GCs (Fig. 3h). 
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Figure 2 | GCs are associated with the transepithelial passage of luminal 
material. a, b, Periodic acid-Schiff (PAS) staining of GCs (a) and dextran 
columns (b) visualized by two-photon microscopy. ¢, PAS and dextran stained 
structures have similar morphology and dimensions. d, Dextran columns were 
often associated with a nucleus (white arrow). e, f, Dextran columns (Dex, red) 


Because cytokeratin 18 expression is undetectable in LP-DCs by quan- 
titative real-time PCR (Fig. 3h), this suggests that CD103* LP-DCs 
interact selectively with GAPs and can capture GC-derived proteins. 
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colocalized with GC markers cytokeratin 18 (Cyt18, white) (e) and MUC2 

(f). g, Cytokeratin 18-positive cells (white arrows) did not colocalize with the 
M-cell marker GP2 (yellow arrows). h, i, Dextran columns were present in 
healthy human small intestine (h) and stained positive for cytokeratin 18 (i). Scale 
bars, 30 [um (a, b and h), 20 tm (d, e, g and i), and 10 tm (f). Error bars, s.d. 


In two-photon imaging experiments, LP-DCs captured fluorescent 
luminal ovalbumin (Ova) readily from GAPs (Fig. 4a and Supplemen- 
tary Movie 6), similarly to fluorescent dextran and BSA. To determine 
if LP-DCs could process and present luminal antigen, we administered 
Ova in vivo, then sorted LP-DCs and co-cultured them with OTI T 
cells”. T-cell proliferation was assessed on day 3 by both CFDA (dye) 
dilution (Fig. 4b, c) and by counting T cells after culture (Fig. 4d). Total 
LP-DC populations (CD45*, CD11c*, MHCII") were capable of 
inducing modest OTI T-cell proliferation that was significantly greater 
than controls (Fig. 4b, d). CD103* LP-DCs stimulated significant OTI 
T-cell proliferation, whereas CD103° LP-DCs from the same mice did 
not (Fig. 4d). The failure of CD103” LP-DCs to stimulate T cells was 
not due a lack of intrinsic antigen presentation capacity, because they 
were capable of inducing comparable levels of OTI T-cell proliferation 
to CD103* LP-DCs (81.4% to 88.2%, respectively) when Ova was 
added to the cell culture (Fig. 4c). These findings suggest that 
CD103* LP-DCs have cross-presentation capacity, similar to their 
mesenteric lymph node counterparts’. 

In some two-photon time-lapse recordings we observed GAPs form- 
ing or disappearing (Supplementary Movies 3 and 5, lower panels), 
suggesting that GAP formation could be related to changes in GC func- 
tion or secretion. To test whether GC secretion was associated with GAP 


Figure 3 | GAPs deliver soluble antigen to CD103* LP-DCs in the steady 
state. a, b, Time-lapse two-photon imaging of model antigens (red) dextran 
(a) and bovine serum albumin (BSA) (b) delivered by GAPs to LP-DCs (green, 
CD11c-YFP). Antigen from GAPs colocalized with LP-DCs (yellow arrows) over 
time. c, Two-photon imaging of CD11c-YFP CX3CR1-GFP mice. GAPs (red) 
delivered antigen preferentially to CD103* LP-DCs (CD11c-YFP* CX3CR1- 
GFP”; green) over CD103~ LP-DCs (CD11c-YFP* CX3CR1-GEP*; cyan) LP- 
DCs. d, Enumeration of GAPs and GAPs delivering antigen to LP-DC subtypes in 
CD11c-YFP CX;CR1-GFP mice in two-photon recordings. e, Flow cytometry of 
LP cells showed that CD103~ DCs captured more luminal dextran than CD103 
DCs. FITC, fluorescein isothiocyanate. SSC, side scatter. f, Cytospins on sorted 
LP-DCs stained with DAPI (blue) and the GC marker cytokeratin 18 (red). 

g, Significantly more CD103* LP-DCs than CD103” LP-DCs stained cytokeratin 
18-positive per high-powered field (P = <0.001). h, Neither DC population 
expressed detectable cytokeratin 18 mRNA. Scale bar, 15 jim (a and b), 50 um 
(c), 25 jum (f). Data in g taken from nine or more high-powered fields 
(representing more than 150 cells) Data in h performed in triplicate and is 
representative of two experiments. Error bars, s.e.m. nd, not detected. 
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formation, we administered the cholinergic agonist carbamylcholine 
(CCh) to stimulate GC secretion and imaged the small intestine by 
two-photon microscopy. After CCh administration, the frequency of 
GAPs increased markedly, as did the amount of luminal antigen 
colocalized with LP-DCs (Fig. 4e, f), whereas paracellular leaks or 
TEDs were unaffected during our imaging time-frame (Fig. 4f). 
Furthermore, total LP-DCs, CD103* LP-DCs and CD103 LP-DCs 
from CCh-treated mice significantly enhanced the luminal antigen 
presentation in ex vivo T-cell proliferation assays (Fig. 4g and 
Supplementary Fig. 8), indicating that GC secretion is linked mechan- 
istically to GAP formation and luminal antigen delivery. 

We also examined whether the increased number of GAPs in germ- 
free mice (Fig. 4h and Supplementary Fig. 2e, g) correlated with increased 
luminal antigen presentation by LP-DCs. LP-DCs from germ-free mice 


348 | NATURE | VOL 483 | 15 MARCH 2012 


Figure 4 | GAPs are a source of luminal antigen for CD103* LP-DCs in the 
steady-state. a, Two-photon time-lapse images of a LP-DC (CD11c-YFP*, 
green) acquiring fluorescent Ova (yellow arrow) from a GAP (red) in the 
intestinal epithelium (DAPI-stained nuclei, blue). Scale bar, 15 |1m; time stamp, 
min:s of elapsed time. b-d, LP-DC antigen presentation capacity in mice given 
luminal Ova or PBS assessed on day 3 by CFDA dilution (b, c) and by counting 
the number of T cells after co-culture with LP-DCs (d). c, CD103” LP-DCs 
induced T-cell proliferation comparable to CD103~ LP-DCs when exogenous 
antigen was added to the in vitro cultures (blue histogram), PBS controls (red 
histograms). e, f, Two-photon imaging of GAPs and LP-DCs in CCh-treated 
mice. CCh increased the number of GAPs and the colocalization of 10 kDa 
dextran (red) with LP-DCs (green). f, Inset, LP-DCs capturing dextran (yellow 
arrowheads). g, Luminal antigen presentation by LP-DCs following CCh 
administration as compared to controls given luminal Ova. h, Germ-free mice 
had increased GAPs (10 kDa dextran, red; DAPI, blue). i, Luminal antigen 
presentation by LP-DCs from germ-free mice was significantly enhanced 
compared to LP-DCs from SPF housed mice. CD103* LP-DCs presented 
luminal antigen significantly better than CD103° LP-DCs from germ-free 
mice. j, Two-photon imaging of intestines in M1KO mice. M1KO mice lacked 
GAPs (red, 10 kDa dextran) in the epithelium (blue, DAPI). k, Luminal antigen 
presentation by LP-DCs from M1KO mice was undetectable. Scale bar, 20 1m 
(a), 40 pum (f). ns, not significant. *P < 0.05, **P < 0.01, ***P < 0.001. Error 
bars, s.e.m. Data in d, g and k are representative of three or more independent 
experiments, data in i was representative of two independent experiments. In 
e, n = 15 or more images were obtained from three animals for each condition. 


given luminal Ova showed significantly increased antigen presentation 
capacity compared to LP-DCs from SPF mice. Moreover, luminal 
antigen delivery in germ-free mice favoured CD103* LP-DCs over 
CD103° LP-DCs (Fig. 4i), similar to observations in SPF mice. 

To test directly whether GAPs are a major pathway of antigen 
delivery to LP-DCs, we generated mice with an epithelial-cell-specific 
deletion in mouse atonal homologue 1 (Math1), a transcription factor 
required for the development of secretory cell lineages, including 
GCs”. Math1"/"V#Cr mice (M1KO) lacked small intestine GCs (Sup- 
plementary Fig. 9a-c) and in vivo two-photon imaging revealed a 
concomitant loss of GAPs, with the epithelium forming a tight barrier 
to luminal dextran (Fig. 4) and Supplementary Fig. 9d). Moreover, 
LP-DC populations isolated from M1KO mice given luminal Ova 
failed to stimulate OTI T cells above background levels (Fig. 4k and 
Supplementary Fig. 9e). The lack of antigen presentation by CD103~ 
LP-DCs from M1KO mice was not due to an intrinsic defect, because 
these LP-DCs were capable of inducing robust T-cell proliferation 
when exogenous Ova was added to the T-cell cultures (Supplemen- 
tary Fig. 9f). Taken together, these findings indicate that GAPs are a 
major mechanism for delivering luminal antigens to LP-DCs in the 
steady state. 

Understanding how the balance between tolerance and immunity is 
achieved at the intestinal mucosa is crucial for oral vaccine develop- 
ment and the treatment of chronic intestinal inflammatory diseases. 
This study identifies GAPs as a mechanism by which CD103 * LP-DCs 
can acquire innocuous antigens from the intestinal lumen in the steady 
state. How preferential antigen delivery is achieved is the focus of 
ongoing studies. Biased delivery could be a natural consequence of 
CD103" LP-DC sampling the epithelium more actively or perhaps 
being recruited selectively by chemoattractants released by or near 
GCs. GC deficiency or dysfunction in mice and humans has been 
linked to the development of intestinal inflammation®”’. Although 
this association has been attributed to the loss of mucins and other 
biologically active GC products, our findings suggest that GCs could 
play a key role in promoting intestinal immune homeostasis by deliver- 
ing luminal antigen to tolerogenic LP-DCs. 


METHODS SUMMARY 

Mice and human specimens. All mice were of a C57BL/6 background unless 
otherwise specified. Healthy human jejunal sections were obtained from bariatric 
surgeries, placed in PBS containing dextran for 1 h before imaging. Procedures and 
protocols were carried out in accordance with the institutional review board at 
Washington University School of Medicine. 
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Two-photon microscopy. The intravital imaging preparation used in this study 
is similar to previously described methods'!’* with the following differences: 
imaging is performed with the tissue within the peritoneal cavity, faecal material 
is not scraped from the mucosal surface and in some experiments atropine (1 mg 
kg” ') was injected subcutaneously to dampen peristaltic movement of the small 
intestine. At this dose atropine did not affect the formation of TEDs or the 
detection of GAPs. Model fluorescent antigens, dextran (2-5 mg), Ova (2 mg), 
BSA, (2 mg) and FluoSpheres (1 ml undiluted) (all from Invitrogen) were injected 
into the intestinal lumen approximately 2h before imaging. Human resection 
specimens were incubated in 10 1g ml! of dextran at room temperature for 1h 
before imaging. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice and human specimens. C57BL/6 mice, Rag! ~ mice, Balb/c mice, IL-10/~ 
knockout mice on the C3H/HejBir.129 background", OTI T cell receptor trans- 
genic mice’, CX;CR1-GFP knock-in mice”, Math1"" mice”? and Villin Cre trans- 
genic mice” were purchased from The Jackson Laboratory. CD11c-YFP transgenic 
mice” were a gift from M. Nussenzweig and LysM-GFP mice”! were a gift from 
K. Ley. Gnotobiotic mice were obtained from the Washington University Digestive 
Disease Research Core Center murine models core. Animals, other than gnotobiotic 
mice, were housed in a specific-pathogen-free facility and fed routine chow diet. 
Animals were 8 to 16 weeks of age at the time of analysis. Intestines from mice 
receiving 250 rads of gamma irradiation and killed 6h later served as positive 
controls for apoptosis markers. All mice were on a C57BL/6 background unless 
otherwise specified. Healthy human jejunal sections were obtained from bariatric 
surgeries, placed in PBS containing dextran for 1 h before imaging. Procedures and 
protocols were carried out in accordance with the institutional review board at 
Washington University School of Medicine. 

Intravital two-photon microscopy. The intravital imaging preparation used in 
this study is similar to previously described methods’*’* with the following differ- 
ences: the intestine is not exteriorized during imaging, faecal material is not 
scraped from the mucosal surface and in some experiments atropine (1 mgkg ') 
was injected subcutaneously to dampen peristaltic movement of the small intestine. 
Mice were anaesthetized with isofluorane and a small vertical incision was made in 
the abdominal wall to expose the peritoneal cavity and contents. The intestine is 
secured to the bottom of a glass coverslip on the upper chamber plate using a thin 
ring of Vetbond tissue adhesive (3M). Because the coverslip sits directly over the 
incision in the mouse abdomen the tissue remains in the peritoneal cavity for 
imaging. No additional manipulations were performed to image from the serosal 
surface. The imaging chamber was maintained at 37 °C using a dual-channel heat- 
ing system (Warner Instruments). To image from the luminal surface, a small 
longitudinal incision was made in the intestine taking care to avoid large blood 
vessels. Model fluorescent antigens, dextran (2-5 mg), Ova (2 mg), BSA (2 mg) and 
FluoSpheres (1 ml undiluted) (all from Invitrogen) were injected into the intestinal 
lumen ~2h before imaging. Human resection specimens were incubated in 10 pig 
ml * of dextran at room temperature for 1 h before imaging. Time-lapse imaging 
was performed with a custom-built two-photon microscope running ImageWarp 
acquisition software (A&B Software). For time-lapse imaging, we averaged 15 
video-rate frames (0.5s per slice) during the acquisition. Each plane represents 
an image of 220 X 240 1m in the x and y dimensions. 21 to 31 sequential z-steps 
(2.5 bum each) were acquired to form a z-stack. In our experiments epithelial integ- 
rity was assessed by dextran and DAPI staining, which in healthy tissue, demarcates 
the luminal surface of the epithelium (aside from GAPs) and shows an ordered 
arrangement of DAPI-stained nuclei, respectively. In some experiments mice were 
given 10° Salmonella typhimurium strain ¥3716 orally 24-72 h before imaging. 
Flow cytometry, immunohistochemistry and confocal microscopy. Flow 
cytometry and the staining of intestine sections were performed as previously 
described**. Reagents used for flow cytometry include anti-CD11c, anti-CD45, 
anti-CD4, anti-MHCII, anti-CD3, anti-CD8a, anti-PDCA-1 (all from eBioscience), 
7-aminoactinomycin D (7-AAD), anti-CD103 and anti-B220 (both from BD 
Biosciences). Data was acquired with a FACScan cytometer (BD Biosciences) 
retrofitted with additional lasers. Data acquisition was performed using 
CellQuest (BD Biosciences) and Rainbow (Cytek) or FlowJo software (Tree 
Star). Data analysis was performed on a Macintosh computer running FlowJo 
software. 


To immobilize the lysine-fixable fluorescent dextrans, intestinal sections were 
treated with 2% paraformaldehyde immediately after two-photon imaging. 
Reagents used for immunohistochemistry include 4’ ,6-diamidino-2-phenylindole 
(DAPI; Sigma-Aldrich), anti-CD11c, anti-CD103, anti-e cadherin (all from BD 
Biosciences), anti-cytokeratin 18 and anti-MUC2 (both from abcam), anti-GP2 
(MBL International), anti-cleaved cytokeratin 18 (Enzo Life Sciences), and anti- 
cleaved caspase 3 (Cell Signaling Technology). The terminal deoxynucleotidyl 
transferase-mediated dUTP nick end labelling (TUNEL) assay (Trevigen) was 
performed as per the manufacturers recommendations. Confocal microscopy 
was performed using a Zeiss LSM510 Meta laser scanning confocal microscope 
(Carl Zeiss) equipped with a X63, 1.4 numerical aperture Zeiss Plan Apochromat 
oil objective. Images were obtained using the Zeiss LSM510 software. 

T-cell proliferation assays. Mice were anaesthetized and the small intestine 
injected intraluminally with 2 mg of Ova (Sigma-Aldrich) dissolved in phosphate 
buffered saline (PBS), or PBS alone (controls) as in the two-photon imaging 
experiments. In some experiments mice received 341g of carbamylcholine 
(Sigma-Aldrich) subcutaneously 20min after the administration of luminal 
Ova. Two hours after the administration of Ova, cell populations were isolated 
from the intestinal lamina propria as previously described” and sorted by flow 
cytometry into total LP-DC populations (7-AAD, CD45* MHCII* CD11c’*), 
CD103* DC populations (7-AAD~, CD45* MHCII* CD11c* CD103*), or 
CD103— DC populations (7-AAD~, CD45" MHCII* CD1ic* CD103°). 
Sorted DC populations were cultured with sorted CFDA (Invitrogen)-labelled 
OTI splenic T cells at a ratio of 1:10 DC to T cells. As a positive control, 20 pg 
of Ova was added to cultures of DC populations isolated from mice receiving 
luminal PBS, unless otherwise stated. After 3 days cultures were evaluated for 
CFDA dilution and the number of T cells by flow cytometry and cell counting. 
Real-time PCR. RNA isolation, cDNA synthesis, standard curve construction and 
real time PCR were performed as previously described”. The following primers 
were used: 18S, forward, 5'’-CGGCTACCACATCCAAGGAA-3’ and reverse, 
5'-GCTGGAATTACCGCGGCT-3’; cytokeratin 18, forward, 5’-CAGCCAGCG 
TCTATGCAGG-3’ and reverse, 5’-CTTTCTCGGTCTGGATTCCAC-3’. Small 
intestine epithelial cell populations were stained with cytokeratin 18 and 
AlexaFluor 488 (Invitrogen)-labelled lectin from Ulex europaeus (UEA-I; 
Sigma-Aldrich). Goblet cell populations were isolated by flow cytometric sorting 
as UEA-I* cytokeratin 18° cells. 

Statistical analysis. Data analysis using Student’s t test or a one-way ANOVA 
with a Tukey’s post test were performed using GraphPad Prism (GraphPad 
Software). 
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Free fatty acids provide an important energy source as nutrients, 
and act as signalling molecules in various cellular processes’. 
Several G-protein-coupled receptors have been identified as free- 
fatty-acid receptors important in physiology as well as in several 
diseases**°, GPR120 (also known as O3FARI1) functions as a 
receptor for unsaturated long-chain free fatty acids and has a critical 
role in various physiological homeostasis mechanisms such as 
adipogenesis, regulation of appetite and food preference’®'*’. 
Here we show that GPR120-deficient mice fed a high-fat diet 
develop obesity, glucose intolerance and fatty liver with decreased 
adipocyte differentiation and lipogenesis and enhanced hepatic 
lipogenesis. Insulin resistance in such mice is associated with 
reduced insulin signalling and enhanced inflammation in adipose 
tissue. In human, we show that GPR120 expression in adipose tissue 
is significantly higher in obese individuals than in lean controls. 
GPR120 exon sequencing in obese subjects reveals a deleterious 
non-synonymous mutation (p.R270H) that inhibits GPR120 sig- 
nalling activity. Furthermore, the p.R270H variant increases the 
risk of obesity in European populations. Overall, this study demon- 
strates that the lipid sensor GPR120 has a key role in sensing dietary 
fat and, therefore, in the control of energy balance in both humans 
and rodents. 

To investigate the role of GPR120 in metabolism, we examined 
GPR120-deficient mice (Supplementary Fig. 1) with respect to lipogen- 
esis, glucose and energy homeostasis. On a normal diet containing 13% 
fat, the body weight was similar in both GPR120-deficient and wild- 
type mice. However, when 5-week-old GPR120-deficient mice were fed 
a high-fat diet (HFD) containing 60% fat, their body weight increase 
was ~ 10% higher than that of wild-type mice on a HFD (Fig. 1a). The 
difference in HFD-induced body weight gain between wild-type and 
GPR120-deficient mice was marked at ~8-10 weeks old and reached a 
plateau at 13 weeks old. To assess energy expenditure and substrate 
utilization, we next performed indirect calorimetry on wild-type and 


mutant mice ona HFD at 9-10 weeks old (Fig. 1b) and 15-16 weeks old 
(Supplementary Fig. 2a). The young GPR120-deficient mice showed 
decreased energy expenditure compared with the young wild-type 
mice, particularly during the light/inactive phase (Fig. 1b, left), whereas 
older mutant and wild-type mice showed no such a difference 
(Supplementary Fig. 2a, left). The difference in energy expenditure 
between GPR120-deficient and wild-type mice was observed only in 
the light phase, indicating that the lack of the GPR120 receptor 
primarily affects basal metabolism, especially in young mice. The 
decreased energy expenditure might explain the difference we found 
in body weight gain between HFD-fed wild-type and mutant young 
mice. The lower values of respiratory quotient in mutant mice could be 
due to insufficient glucose utility, probably as a result of the decreased 
insulin sensitivity. In all experiments, both groups of mice showed 
similar levels of locomotor activity (data not shown). 

White adipose tissue (WAT) and liver were substantially heavier in 
HFD-fed GPR120-deficient mice (Supplementary Fig. 2b). Plasma 
low- and high-density lipoprotein cholesterol levels were significantly 
higher in HFD-fed GPR120-deficient mice, and serum alanine amino- 
transferase levels were substantially increased, indicating abnormal 
cholesterol metabolism and liver function (Supplementary Table 1). 
Microcomputed tomography scanning revealed that 16-week-old 
GPR120-deficient mice stored much more fat than did wild type 
(Fig. 1c). A significant increase in adipocyte size in both epididymal 
(Fig. 1d) and subcutaneous (Supplementary Fig. 2c) fat was observed 
in GPR120-deficient mice. Furthermore, the expression of macro- 
phage marker genes (Cd11b (Itgam), Cd68 and F4/80 (Emr1)) and 
the number of F4/80-positive cells were markedly enhanced in epidi- 
dymal tissue from HFD-fed GPR120-deficient mice (Fig. le, f). 
Moreover, these mice showed liver steatosis and hepatic triglyceride 
content was markedly increased (Fig. 1g). Overall, HFD-induced obesity 
and liver fattiness were more severe in GPR120-deficient mice than in 


wild type. 
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Obesity-associated insulin resistance was also more severe in 
GPR120-deficient mice. HFD-fed GPR120-deficient mice showed 
higher levels of fasting plasma glucose and insulin than did wild type, 
although these parameters were similar between the two groups on a 
normal diet (Fig. 2a). HFD-induced insulin resistance, as determined 
by an insulin tolerance test, was more prominent in GPR120-deficient 
mice than in wild type (Fig. 2b, left, and Supplementary Fig. 3a, b). 
A glucose tolerance test further revealed that these mice suffered 
from impaired glucose metabolism (Fig. 2b, right, and Supplementary 
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Figure 1 | Obesity, hypertrophic adipocytes, accumulation of pro- 
inflammatory macrophages and hepatic steatosis in HFD-fed GPR120- 
deficient mice. a, Body weight changes of wild-type (WT) and GPR120- 
deficient mice fed a normal diet (ND) or a HFD (n = 36-47). b, Indirect 
calorimetry in HFD-fed mice. Energy expenditure and respiratory quotient 
(n = 4, 5). ¢, Representative cross-sectional images of wild-type and GPR120- 
deficient mice subjected to microcomputed tomography analysis of the in situ 
accumulation of fat. Fat depots are demarcated (green) for illustration. 

d, Haematoxylin and eosin (H&E)-stained epididymal WAT and mean area of 
adipocytes (n = 6). Scale bar, 100 um. e, Relative expression of Cd11b, Cd68 and 
F4/80 messenger RNA in WAT (n = 6). a.u., arbitrary units. f, Representative 
images of epididymal WAT stained with anti-F4/80 antibody (arrows, F4/80- 
positive cells) and the number of F4/80 cells (n = 6). Scale bar, 100 tum. g, Oil 
Red O-stained liver and hepatic triglyceride content after 24hr fasting (n = 13). 
Scale bar, 50 um. All data represent mean + s.e.m. *P < 0.05 and **P<0.01 
versus the corresponding wild-type value. 


Fig. 3a, b). The level of plasma leptin was significantly higher in HFD- 
fed GPR120-deficient mice than in wild type (Supplementary Fig. 3c). 
However, there was no significant difference in terms of plasma 
adiponectin level or food intake between the two groups (Supplemen- 
tary Fig. 3d, e). HFD-fed GPR120-deficient mice showed a marked 
increase in the size of islets and KI67 (MKI67)-positive cells, suggesting 
adaptive enlargement of the B-cell mass in response to insulin resist- 
ance’”"* (Supplementary Fig. 3f, g). Moreover, we observed markedly 
reduced peripheral insulin sensitivity in tissues from HFD-fed GPR120- 
deficient mice (Fig. 2c). Insulin was shown to induce the phosphoryla- 
tion of AKT (AKT1) (on Ser 473) in WAT, liver and skeletal muscle, 
with similar intensities in wild-type and GPR120-deficient mice on a 
normal diet (Supplementary Fig. 3h). Consistent with the insulin res- 
istance reported above, HFD-fed GPR120-deficient mice showed loss of 
insulin-induced AKT phosphorylation in WAT and the liver. 

To determine the molecular basis of the metabolic changes in WAT 
and livers of GPR120-deficient mice, we performed gene expression 
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Figure 2 | Impaired glucose metabolism, adipogenesis and lipogenesis in 
HFD-fed GPR120-deficient mice. a, Fasting blood glucose and serum insulin 
levels (n = 6-15). b, Plasma glucose during insulin tolerance test (ITT, left) and 
glucose tolerance test (GTT, right) (n = 12-14). c, Phosphorylation of AKT 
(Ser 473) in WAT, liver and skeletal muscle after 24-hr fasting (n = 6, 7). 

NS, not significant. d, Relative mRNA expression of Fabp4 and Scd1 in WAT or 
Scd1 in liver (n = 6). e, Protein expression of IRB, IRS1, IRS2, SCD1 and f-actin 
in WAT. f, Protein expression of IRS1, IRS2, SCD1 and f-actin in liver. 

g, Oil Red O-staining and triglyceride (TG) content of mouse embryonic 


fibroblast (MEF)-derived adipocyte. Scale bar, 50 um. h, Relative mRNA 
expression in MEF-derived adipocyte (n = 6). i, The ratio of C18:1 to C18:0 in 
livers (n = 6-8). j, Non-esterified C16:1n7 palmitoleate in WAT and plasma 
(n = 4-7). k, The ratio of Scd1 mRNA expression in liver and WAT (n = 6, 7). 
1, The ratio of C16:1 to C16:0 in adipose tissues (m = 6-8). m, Hepatic Scd1 
mRNA expression in mice infused with vehicle or triglyceride:palmitoleate for 
6h (n= 4, 5). All data represent mean + s.e.m. *P < 0.05 and **P< 0.01 
versus the corresponding wild-type value. 
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analyses. We identified approximately 700 differentially expressed 
genes in WAT between HFD-fed GPR120-deficient and wild-type 
mice (Supplementary Fig. 4a). Connectivity mapping of these genes 
showed that pathways relating to insulin signalling and adipocyte 
differentiation were depressed, whereas those related to inflammation 
were enhanced in HFD-fed GPR120-deficient mice (Supplementary 
Fig. 4b). Quantitative real-time PCR (qRT-PCR) analysis confirmed 
the downregulation ofinsulin-signalling-related genes (Insr, Irs] and Irs2), 
an adipocyte differentiation marker gene (Fabp4) and a lipogenesis- 
related gene (Scd1) in the epididymal fat from HFD-fed GPR120- 
deficient mice (Fig. 2d and Supplementary Fig. 3i). We identified 
approximately 100 differentially expressed genes in the liver between 
HFD-fed GPR120-deficient and wild-type mice (Supplementary Fig. 5). 
Notably, lipogenesis-related genes (Scd1 and Mel) and a fatty acid 
transporter gene (Cd36) were significantly upregulated in livers from 
GPR120-deficient mice. Quantitative RT-PCR analysis confirmed 
upregulation of Scd1 in the liver of GPR120-deficient mice (Fig. 2d). 

Western blot analysis further confirmed downregulation of IRf, 
IRS1 and SCD1 in adipose tissue of HFD-fed GPR120-deficient mice 
(Fig. 2e) but downregulation of IRS1 and IRS2 and upregulation of 
SCD1 in their livers (Fig. 2f). Hence, insulin-signalling-related molecules 
were downregulated by the lack of GPR120 in both adipose tissue and 
the liver. However, the expression of SCD1, the rate-limiting enzyme in 
the biosynthesis of mono-unsaturated fatty acids, was downregulated in 
adipose tissue but upregulated in liver. Furthermore, the expression of 
Scd1 and several adipogenic genes'*”” (Pparg, Fabp4 and Srebf1) was 
suppressed in mouse-embryonic-fibroblast-derived adipocytes from 
GPR120-deficient mice, indicating that GPR120 is required for normal 
adipogenesis, as previously reported in differentiating 3T3-L1 adipocytes 
depleted of endogenous GPR120 by short interfering RNA" (Fig. 2g, h). 

To determine the effects of altered lipogenesis on lipid composition 
in GPR120-deficient mice, we performed lipidomics analysis in WAT, 
livers and plasma. Significant changes of major lipid clusters were 
observed (Supplementary Fig. 6). Notably, the hepatic concentration 
of oleate (C18:1n9c) was significantly higher in HFD-fed GPR120- 
deficient mice than in wild type. The ratio of C18:1 to C18:0, an 
indicator of SCD1 enzyme activity”, was markedly enhanced in 
livers from HFD-fed GPR120-deficient mice relative to wild type 
(Fig. 2i). Moreover, the levels of C16:1n7 palmitoleate, which has 
recently been recognized as a lipid hormone’, in WAT and plasma 
were significantly lower in HFD-fed GPR120-deficient mice than in 
wild type (Fig. 2j). In particular, lower levels of C16:1n7 palmitoleate 
were detected even in WAT of GPR120-deficient mice on a normal 
diet (Fig. 2j), which seems to be in good agreement with the suppressed 
Scd1 expression and the reduced SCD1 desaturation index*”” (C16:1/ 
C16:0; Fig. 2k, 1). Lipidomics analysis clearly illustrated dysregulated 
lipogenesis in GPR120-deficient mice, and showed the reduced pro- 
duction of lipid hormone C16:1n7 palmitoleate*. To determine 
whether the enhanced hepatic lipogenesis in GPR120-deficient mice 
was due to the reduced levels of C16:1n7 palmitoleate, we examined 
the effect of C16:1n7 palmitoleate treatment on hepatic Scd1 expres- 
sion. A 6-h infusion of triglyceride:palmitoleate markedly lowered the 
enhanced hepatic Scd1 expression in GPR120-deficient mice (Fig. 2m). 
The results indicated that the reduced C16:1n7 palmitoleate may 
explain the systemic metabolic disorders observed in GPRI120- 
deficient mice on a HFD, as palmitoleate has been proposed to be a 
bioactive lipid by which adipose tissue communicates with distant 
organs (such as liver) and regulates systemic metabolic homeostasis’. 
This study shows that dysfunction of GPR120 can be an underlying 
mechanism for diet-associated obesity and obesity-related metabolic 
disorders in mouse. 

The mouse data prompted us to assess the potential contribution of 
GPR120 to the development of obesity and its metabolic complications 
in humans. First, the expression levels of GPR120 in both subcutaneous 
and omental adipose tissues as well as in liver samples were compared 
between lean and obese subjects. Normoglycaemic obese patients and 


352 | NATURE | VOL 483 | 15 MARCH 2012 


lean individuals (n = 14 in each group) were matched for age and 
gender (Supplementary Table 2). As previously described*"*, we con- 
firmed that GRP120 is barely expressed in liver of either lean or obese 
subjects (data not shown). By contrast, we found that GPR120 is well 
expressed in the adipose tissue of lean individuals (Fig. 3a). In addi- 
tion, human obesity is significantly associated with an increase in 
GPR120 expression in both subcutaneous and omental adipose tissues 
(1.8-fold increase; P = 0.0004 and P = 0.003, respectively). We also 
found that GPR120 expression in subcutaneous adipose tissue 
strongly correlates with that in omental adipose tissue (Spearman 
analysis; r = 0.570 and P = 2.74 X 10°), suggesting a systemic regu- 
lation of its expression in humans. Furthermore, we found a positive 
correlation between GPR120 expression in both subcutaneous and 
omental adipose tissues and in subjects’ concentrations of plasma 
low-density lipoproteins (on adjustment for age and sex; r = 0.247, 
P= 0.0115 and r= 0.255, P = 0.0118, respectively). 

To investigate the contribution of GPR120 to human obesity, the four 
GPR120 exons were sequenced in 312 French, non-consanguineous, 
extremely obese children and adults (Supplementary Table 3). We 


a b 


OwT 
@ R270H 
@ R67C 


GPR120 ratio to GAPDH 
o S <4 
° ° 3° 
8 = = 
Oo a oO a 
RFU (103) 
o- MO RD 


G ie) »y yh 
VL oS A 
PF oF yw 99 1 10 100 
c DMSO d 
PMA +k NS ee 
BALA 
B12 
#3 
Oo 
Sa 10 a 
Bs 08 6 
a3 06 2 
ON oc 
oe 04 
2 
BS 02 
oS 
= 0) 
Control WT R270H R67C 
e [ALA] (uM) 
wmMwt Pomy )—BRISIORIFLAG)—| WT IRESUHAT, wT 
tI 
wr/r270H —{ Poy »—2RTStORIFUAGI—| wT | {IRESI-SsHiann-(iRB7on- 
re7oHwT [Pow >See — aa wT | 
4 
e@ wt 
5 Anti-FLAG 3 Anti-HA . @ W/R270H 
4 a @ R270H/WT 
4 aig J 2 I Dox é 
z Owr 5 2 
2 1 Mre70H & 
F 
1 
ie} | | 0 | 
ey a 
ef AS & & AS RY 0 
Se LS Le LS 
3 & g 
a a 1 10 100 


[ALA] (uM) 


Figure 3 | GPR120 expression in human obese tissue samples, and effect of 
GPR120 variants on [Ca”*]; response and GLP-1 secretion. a, GPR120 
mRNA levels in human subcutaneous (SC) and omental (OM) adipose tissues of 
lean (LN; n = 14) and obese (OB; n = 14) normoglycaemic individuals. Mann- 
Whitney analysis, ***P = 0.0004 and **P = 0.003. b, ALA-induced [Ca**]; 
responses in cells expressing wild-type GPR120 or a p.R67C or p.R270H variant. 
c, ALA-induced GLP-1 secretion in NCI-H716 cells expressing a wild-type 
GPR120, a p.R67C ora p.R270H receptor. d, Effect of transfection with GPR120 
variants on ALA-induced [Ca”*]; response in cells stably expressing wild-type 
GPR120. e, Effect of co-expression of human GPR120 p.R270H variant with 
wild-type GPR120 on ALA-induced [Ca*"]; response. Top: schematic diagram 
of constructs. Bottom: expression of wild type and p.R270H (left), and 
concentration-[Ca’*]; response for ALA in cells expressing wild-type/wild- 
type, wild-type/R270H or R270H/wild-type receptors (right). **P < 0.01 versus 
the corresponding control value. RFI, relative fluorescence intensity; RFU, 
relative fluorescence unit. All data show mean = s.e.m. 
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Table 1 | Identified variants in GPR120 exons and association between the p.R67C/rs6186610 or p.R270H non-synonymous variant and 
obesity 


Variant Nucleotide Chr 10 MAF, MAF> MAF> Adjustment: age and gender Adjustment: age, gender and geography 
change position (controls, (cases, 
n=7,654) n=6,942) OR [95% Cl] P OR [95% Cl] P 
Missense p.R67C/rs6186610 CT 95,316,666 0.05 0.043 0.055 1.16 [1.02, 1.31] 0.022 1.13 [1.00, 1.28] 0.060 
variants p.R270H GA 95,337,031 0.03 0.013 0.024 1.62 [1.31, 2.00] 8.00 x 10°© 1.58 [1.28, 1.95] 2.17 x 107? 
Synonymous p.V38V GA 95,316,581 0.0016 = — _— —_ _— = 
variants p.S192S GA 95,325,846 0.0016 = = —= = = = 
p.V243V C3T 95,328,938 0.0016 _— _— —_ _— _— —_— 
p.S264S GA 95,337,014 0.0016 = — = = — — 


Variant position was indicated according to human genome build NCBI36/hg18. Association between p.R67C/rs6186610 or p.R270H variant and obesity was assessed by using a logistic regression adjusted for 
age and gender or for age, gender and geography, under an additive model. Chr, chromosome; MAF, minor allele frequency in sequencing data set (n = 312 extremely obese individuals); MAF>, minor allele 
frequency in the large obesity case-control genotyping data set; OR, odds ratio; Cl, confidence interval. 


identified only two non-synonymous variants, R270H (minor allele 
frequency (MAF), ~3%) and p.R67C/rs6186610 (MAF, ~5%), and 
four rare synonymous variants (MAF, <1%) (Table 1). The two non- 
synonymous variants were subsequently genotyped in 6,942 unrelated 
obese individuals and 7,654 control subjects, all of European origin 
(Supplementary Table 4). By using a logistic regression model adjusted 
for age and sex, we found that R270H associated with obesity under an 
additive model (OR= 1.62 [1.31, 2.00]o50, (odds ratio and 95% 
confidence interval), P = 8.00 X 10 °; Table 1); whereas we found 
only a trend for association between p.R67C and obesity (OR= 
1.16 [1.02, 1.31] 95%, P = 0.022; Table 1). It is noteworthy that these 
results were almost the same after adjusting for geographical origin 
(Table 1). 

We then genotyped the p.R270H variant in 1,109 French pedigrees 
selected for obesity (n = 5,045) and in 780 German trios with one 
obese child (n = 2,340). We observed a significant over-transmission 
of the p.R270H low-frequency variant to obese offspring in 117 
pedigrees or trios where the p.R270H variant was present (transmis- 
sion, 62%; P = 0.009; Supplementary Table 5). This family-based study 
excludes a hidden population stratification effect as a cause of spurious 
association. 

Weassessed the functional significance of both the p.R67C mutation 
and the p.R270H mutation in silico using several programs: arginine 
residues at positions 67 and 270 presented a high-evolutionary- 
conservation pattern among mammals and the two amino-acid sub- 
stitutions were predicted to be potentially damaging (Supplementary 
Table 6). To examine the influences of the two non-synonymous var- 
iants on GPR120 function in vitro, we assessed each receptor ability to 
mobilize intracellular calcium (concentration, [Ca?*];) in response to 
the endogenous agonist o-linolenic acid (ALA). We found that ALA 
induced [Ca”*], responses in T-REx 293 cells expressing either wild- 
type or p.R67C receptor in a dose-dependent manner, whereas 
ALA-induced [Ca’*]; responses in cells expressing p.R270H were sig- 
nificantly lower (P = 1.6 X 10°) than those in cells expressing wild 
type at ALA concentrations greater than 10 1M (Fig. 3b). We further 
examined the functional ability of the mutated receptors to secrete 
GLP-1 (ZGLP1) from human intestinal NCI-H716 cells, as this cell 
line lacks GPR120 expression and it can secrete GLP-1 in a regulated 
manner’. ALA induced secretion of GLP-1 in NCI-H716 cells expres- 
sing either wild-type (P = 0.004) or p.R67C (P = 3.2 x 10 °) receptor, 
but not in NCI-H716 cells expressing p.R270H mutant (P = 0.96) 
(Fig. 3c). The transfection efficiencies for the GPR120 variant receptors 
were confirmed to be almost the same (data not shown). To examine 
the effect of the p.R270H variant on the wild-type receptor signalling, 
we analysed the [Ca”*]; dose-response curves after the transfection of 
an empty vector, a wild-type receptor plasmid or a p.R270H-mutated 
plasmid into T-REx 293 cells expressing wild-type GPR120. The trans- 
fection of the p.R270H-mutated plasmid suppressed dose-response 
curves, and the maximal ALA-induced [Ca’"]; response was signifi- 
cantly decreased (P = 0.004; Fig. 3d). 

To assess the effect more quantitatively, we analysed [Ca**], dose- 
response curves in T-REx 293 cells stably expressing bicistronic 
wild-type/wild-type, wild-type/p.R270H or p.R270H/wild-type receptors 


(Fig. 3e, top). Almost equal levels of receptor protein expression in each 
cell line were confirmed by flow cytometry analysis (Fig. 3e, bottom 
left). Compared with cells expressing wild-type/wild-type receptor, 
the [Ca**], dose-response curves obtained in cells expressing either 
wild-type/p.R270H or p.R270H/wild-type receptor were markedly 
suppressed, and the maximal ALA-induced [Ca’ * ]; response was sig- 
nificantly decreased (P= 1.2 X 10°; Fig. 3e, bottom right). These 
findings suggest that the p.R270H variant that is significantly asso- 
ciated with obesity has an inhibitory effect on GPR120. The 
p-R270H mutant lacks the ability to transduce the signal of long-chain 
free fatty acids, contrary to the p.R67C mutant, which did not associate 
with obesity. 

To analyse whether being a p.R270H variant carrier may affect 
GPR120 expression in the adipose tissue, we quantified GPR120 expres- 
sion in samples from both obese p.R270H carriers and obese non- 
carriers. Two hundred and thirty-eight obese normoglycaemic patients 
from the Atlas Biologique de l’Obésité Sévere cohort had already been 
genotyped for the p.R270H variant. Ten subjects heterozygous for the 
p-R270H variant were matched for age, gender and body mass index 
with ten non-carrier (wild-type) obese normoglycaemic patients (Sup- 
plementary Table 7). The expression of GPR120 was similar between 
p-R270H carriers and wild-type subjects, both in subcutaneous and 
omental adipose tissues (Supplementary Fig. 7a), suggesting that the 
presence of the functionally deleterious mutation has no primary or 
secondary effect on gene expression in fat depots. The adipogenesis 
marker PPARG, the lipogenesis-related factor SCD and the macrophage 
marker CD68 were found similarly well expressed in the adipose tissues 
of wild-type and p.R270H carrier patients (Supplementary Fig. 7b, c). 
Nevertheless, the expression of the fatty-acid-binding protein FABP4 
in omental adipose tissue was significantly lower in p.R270H carriers 
than in wild-type individuals (28% decrease, P = 0.043; Supplementary 
Fig. 7b). 

Our results provide convincing evidence that the lipid sensor GPR120 
is involved in obesity in both mice and humans. Given the role of 
GPR120 as a physiologic integrator of the environment (especially the 
fatty diet), these data provide insight into the molecular mechanisms by 
which the “Westernized’ diet may contribute to early-onset obesity and 
associated complications including non-alcoholic steatohepatitis. It also 
brings some understanding of the metabolic effects of the omega-3 fatty 
acids, which are often proposed as food supplements. This may open 
novel avenues of research for drug development in the treatment of 
obesity, lipid metabolism abnormalities and liver diseases, because 
receptors that sense free fatty acids represent attractive drug targets. 


METHODS SUMMARY 


GPR120-deficient mice were generated by deleting Gpr120 exon 1. All animal 
procedures and euthanasia were reviewed by the local animal care committee 
approved by local government authorities. Blood analysis, extraction and detec- 
tion of mRNA and proteins, and immunohistochemical analysis, were performed 
following standard protocols as described previously*****. Details of antibodies, 
primers and probes are given in Methods. The level of significance for the differ- 
ence between data sets was assessed using Student’s t-test. Analysis of variance 
followed by Tukey’s test was used for multiple comparisons. 


15 MARCH 2012 | VOL 483 | NATURE | 353 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


In human, GPR120 expression in liver and in both omental and subcutaneous 
adipose tissues was assessed by quantitative RT-PCR (Taqman), in lean and obese 
subjects from the Atlas Biologique de l’Obésité Sévere cohort. The four GPR120 
exons were sequenced in 312 French, extremely obese subjects following a standard 
Sanger protocol. The two identified non-synonymous variants (p.R270H and 
p-R67C/rs6186610) were subsequently genotyped in a large European obesity 
case-control study (eases = 6,942, Meontrols = 7,654), by high-resolution melting 
analysis and TaqMan, respectively. The association between obesity status and 
each variant was assessed using logistic regression adjusted first for age and gender 
and then for age, gender and geography origin, under an additive model. The 
consequences of the two identified non-synonymous variants for GPR120 func- 
tion ([Ca?*]; response and GLP-1 secretion) were assessed in vitro. The human 
study protocol was approved by the local ethics committee, and participants from 
all of the studies signed an informed consent form. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Generation and genotyping of GPR120-deficient mice. GPR120-deficient mice 
on a mixed C57Bl/6/129 background were generated by homologous recombina- 
tion. Exon 1 of the Gpr120 gene was replaced with a PGK-neo cassette (Supplemen- 
tary Fig. 1). 

Animals. Mice were housed under a 12-hr light-dark cycle and given regular 
chow, MF (Oriental Yeast Co.). For HFD studies, 5-week-old male mice were 
placed on a 58Y1 diet (PMI Nutrition International) for a total period of 11 weeks. 
The methods used for animal care and experimental procedures were approved by 
the Animal Care Committee of Kyoto University. 

Indirect calorimetry. Twenty-four-hour energy expenditure and respiratory 
quotient (RQ) were measured by indirect calorimetry, using an open-circuit 
calorimeter system (MK-5000RQ, Muromachi Kikai Co.). Respiratory quotient 
is the ratio of carbon dioxide production to oxygen consumption (VO). Energy 
expenditure was calculated as the product of the calorific value of oxygen 
(3.815 + 1.232RQ) and VO>. Locomotor activity was measured by using an 
infrared-ray passive sensor system (Supermex, Muromachi Kikai Co.). 
Histology and immunohistochemistry. Epididymal adipose and pancreatic tissues 
were fixed in 10% neutral-buffered formalin, embedded in paraffin, and sectioned at 
5 uum. H&E staining was performed using standard techniques. To measure the 
diameter of adipocytes and the area of pancreatic islets, the diameters of 100 cells 
from five sections from each group were measured using NIH IMAGE software. More 
than 10 fields were examined, islet area was traced and total islet area was calculated 
and expressed as the average score. Liver tissues were embedded in OCT compound 
(Sakura Finetech) and snap-frozen in liquid nitrogen. Tissue sections were stained 
with Oil Red O (Sigma-Aldrich) for lipid deposition using standard methods. 
Triglyceride content assay. To determine the triglyceride content of liver, tissue 
was homogenized with 1/2.5/1.25 (vol/vol) 0.5M_ acetic acid/methanol/ 
chloroform. The mixture was shaken and 1.25 volumes of chloroform added. 
The mixture was shaken overnight and then 1.25 volumes of 0.5 M acetic acid 
added. After centrifugation at 1,500g for 10 min, the organic layer was collected, 
dried and resuspended in 100% isopropyl alcohol. Measurements were conducted 
using Triglyceride E-test Wako (Wako). 

Glucose tolerance and insulin tolerance tests. Glucose tolerance assays were 
performed on 24-hr-fasted mice. After baseline glucose values were individually 
established using One Touch Ultra (LifeScan), each mouse was given an intra- 
peritoneal injection of 1.5 mg glucose per gram of body weight. Insulin tolerance 
was conducted using the same glucometer. After baseline glucose values were 
established, mice were given human insulin (0.75mU g | intraperitoneal; 
Sigma-Aldrich). Clearance of plasma glucose was subsequently monitored at 15, 
30, 60, 90 and 120 min post-injection. 
Immunoblot analysis. For insulin stimulation, 5 U insulin (Sigma-Aldrich) was 
injected through the inferior vena cava. Five minutes later, samples of liver, skeletal 
muscle or WAT were dissected and immediately frozen in liquid nitrogen. Immuno- 
blot analysis were performed as described previously’*”*. Anti-IRS1 (Millipore), 
anti-IRS2 (Millipore), anti-SCD1 (Santa Cruz Biotechnology), anti-IRB anti-AKT 
(Cell Signaling Technology), anti-p-AKT (Cell Signaling Technology) and anti-B- 
actin (Sigma-Aldrich) antibodies were used as the primary antibodies. 

Mouse gene expression analysis. Total RNA was extracted from tissue or cells 
using ISOGEN (Nippon Gene). Quantitative RT-PCR and microarray analysis 
were performed as described previously”*”*. Briefly, genome-wide mRNA expres- 
sion profiles were obtained by microarray analysis with the Affymetrix GeneChip 
Mouse 430 2.0 Array, according to the manufacturer’s instructions. We used the 
robust multi-array analysis expression measure that represents the log-transformation 
of intensities (background corrected and normalized) from the GeneChips”. 
Functional associations between differentially expressed genes were analysed using 
Ingenuity Pathways Analysis (version 4.0, Ingenuity Systems). 

Microcomputed tomography scanning. Images were obtained using a micro- 
computed tomography system (SHIMADZU ClairvivoCT) with a high-resolution 
flat-panel detector. The maximum resolution of this modality was less than 40 um. 
The scanner was assumed to have a cylindrical field of view of 65.3 mm in section 
view and of 300 mm in transaxial view. The X-ray source was biased at 60 keV with 
the anode current set to 160 1A. Computed tomography images were analysed 
with OSIRIX software (http://www.osirix-viewer.com/). 

Fatty acid composition of epididymal WAT, liver and plasma. Esterified and 
non-esterified fatty acid composition was measured by gas chromatography. 
Briefly, to analyse esterified fatty acid, samples of epididymal adipose tissue 
(20-25 mg), liver (25-30 mg) and plasma (1001) were snap-frozen in liquid 
nitrogen and homogenized in 4 ml of 0.5 N KOH-methanol. Samples were then 
boiled at 100 °C for 30 min to hydrolyse. Total lipids in each sample homogenate 
were then extracted with hexane, followed by trans-esterification of fatty acids 
using boron trifluoride-methanol at 100 °C for 15 min. Methylated fatty acids were 
then extracted with hexane and analysed using a GC-2010AF gas chromatograph 
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(SHIMADZU). For the analysis of non-esterified fatty acid, samples of epididymal 
adipose tissue (10-15 mg), liver (10-15 mg) and plasma (100 pl) were snap- 
frozen in liquid nitrogen and homogenized in a mixture of 1.2 ml water, 3 ml 
methanol and 1.5 ml chloroform. Total lipids in each sample homogenate were 
extracted with a mixture of 1.2 ml of water and 1.2 ml of chloroform, followed by 
silylation of fatty acids using N,O-bis(trimethylsilyl)trifluoroacetamide with 1% 
trimethylchlorosilane at 100 °C for 60 min. Silylated fatty acids were then extracted 
with hexane and analysed using a GC-2010AF gas chromatograph (SHIMADZU). 
Mouse embryonic fibroblast adipogenesis assay. To prepare MEFs, we minced 
13.5-d-post-coital mouse embryos and digested them with trypsin. Cells were 
collected and cultured in modified Eagle’s medium («%-MEM; supplemented with 
10% fetal bovine serum (FBS), 50 U ml’ penicillin and 50 mg ml ' streptomycin). 
We induced confluent MEFs to undergo adipogenic differentiation by incubating 
them first for 2d with 10 pg ml! insulin, 250nM dexamethasone and 0.5mM 
isobutylmethylxanthine (Sigma-Aldrich). We measured cellular triglyceride con- 
tent with Triglyceride E-test Wako (Wako). 

Lipid infusion. Intralipid solution with 2 mM triglycerides:palmitoleate was pre- 
pared using a previously described protocol’. Briefly, lipids were dissolved in a 
solvent containing 5% glycerol and 0.72% phosphocholine in 0.9% saline and 
sonicated repeatedly. Lipids stayed in suspension for one week and had to be 
vortexed well before loading the syringe and tubing to prevent clogging. Before 
lipid infusion, mice were anaesthetized and an indwelling catheter was inserted in 
the left internal jugular vein. After overnight fasting, lipids were infused at a rate of 
500 ml kg! h’! for 6h. At the end of the infusion, tissues were collected. 
Statistical analysis of the GPR120-deficient mouse study. The level of signifi- 
cance for the difference between data sets was assessed using Student’s t-test. 
Analysis of variance followed by Tukey’s test was used for multiple comparisons. 
Data were expressed as mean + s.e.m. P< 0.05 was considered to be statistically 
significant. 

Human study population. The study protocol was approved by all local ethics 
committees and informed consent was obtained from each subject before par- 
ticipation in the study, in accordance with the Declaration of Helsinki principles. 
For children younger than 18 years, an oral consent was obtained and parents 
provided written informed consent. All subjects were of European origin. 
Human gene expression analysis. We used liver, subcutaneous and omental 
adipose tissue samples from the Atlas Biologique de l’Obésité Severe’ (ABOS) 
cohort (ClinicalGov NCT01129297), a cohort studied in the Département de 
Chirurgie Générale et Endocrinienne* (Lille CHRU). Total RNA was extracted 
from the tissues using an RNeasy protect Mini Kit (QIAGEN) and quantified by 
absorbance at 260 nm and 280 nm in a PerkinElmer spectrophotometer. Human 
GPR120, FABP4, PPARG, CD68 and SCD mRNA levels were quantified by reverse 
transcription reaction followed by qRT-PCR. Quantitative assessment of human 
mRNA expression was performed using TaqMan Gene Expression Assays 
(Hs01111664_m1: GPR120 and Hs99999905_m1: GAPDH; Hs00609791_m1: 
FABP4; Hs00234592_m1: PPARG; Hs00154355_m1: CD68; Hs01682761_m1: 
SCD; Applied Biosystems) with an Applied Biosystems 7900HT Fast Real-Time 
PCR System. As an internal control for potential housekeeper reference variability, 
gene transcript levels were normalized to GAPDH reference housekeeper tran- 
script level. The mean of the triplicate cycle thresholds of the target was normalized 
to the mean of triplicate cycle thresholds of the reference internal housekeeper genes 
using the formula 2°Tro# — CTs, which yielded a relative target-to-reference tran- 
script concentration value as a fraction of reference transcript. Samples for which 
the cycle threshold was above 35 were excluded from the analysis. 

GPR120 exon sequencing. We sequenced the four GPR120 exons in 312 obese 
patients including 121 French obese adults and 191 French obese children who 
were recruited by the CNRS-UMR8199 unit and the Department of Nutrition of 
Paris Hotel Dieu Hospital. GPR120 is located on human chromosome 10q23.33 
and encodes a 377-amino-acid protein (NCBI NM_181745.3 and NP_859529). 
PCR conditions and primer sequences are available on request. Fragments were 
bidirectionally sequenced using the automated 3730xl DNA Analyzer (Applied 
Biosystems). Electrophoregram reads were assembled and analysed using the 
VARIANT REPORTER software (Applied Biosystems). The locations of the var- 
iants are displayed by base numbers counting from the ATG translation initiation 
codon following the Human Genome Variation Society nomenclature for the 
description of sequence variations. The positions of mutations were indicated 
by reference to the human genome build NCBI36/hg18. 

Genotyping of the p.R270H and p.R67C/rs6186610 variants. We genotyped 
the two non-synonymous variants in 6,942 unrelated obese subjects and in 7,654 
control subjects, all of European descent. Genotyped populations are described in 
Supplementary Table 4. The set of obese subjects included 516 unrelated French 
obese children who were recruited by the CNRS-UMR8199 unit or Toulouse 
Children’s Hospital”’; 332 Italian obese children from Verona*’ or Rome”!; 170 
Finnish obese adolescents from the Northern Finland Birth Cohort 1986°° 
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(NFBC1986); 1,164 unrelated French obese adults from the ABOS cohort”* or 
recruited by the CNRS-UMR8199 unit and the Department of Nutrition of 
Paris Hotel Dieu Hospital’; 2,514 Belgian obese patients from the outpatient 
obesity clinic at Antwerp University Hospital’; 1,736 Swiss obese subjects who 
were recruited after gastric surgery in Zurich**; and 510 Greek obese subjects 
recruited in the Hippokration Hospital of Thessaloniki or in the Second 
Department of Internal Medicine of the Hospital of Alexandroupolis*’. The set 
of control subjects included 422 Italian lean children from Verona”; 4,639 Finnish 
lean adolescents from the NFBC1986 cohort*’; 1,976 French lean adults from the 
D.E.S.L.R (Data from the Epidemiological Study on the Insulin Resistance 
syndrome) prospective study** and from the Haguenau study”; 148 Belgian lean 
subjects from Antwerp Hospital’’; and 469 Greek lean individuals recruited in 
medical examination centres in Thessaloniki*. The 1,109 French pedigrees 
selected for obesity were recruited by the CNRS-UMR8199", and the 780 
German childhood obesity trios were recruited at the Universities of Marburg 
and Essen’’. The p.R270H variant was genotyped using the LightCycler 480 
High Resolution Melting (HRM) Master kit (Roche), following the manufacturer’s 
protocol. Genotyping of p.R67C/rs6186610 was performed using a custom TaqMan 
assay according to the manufacturer’s instructions (Applied Biosystems). Allelic 
discrimination was performed using an Applied Biosystems 7900HT Fast Real- 
Time PCR System and SDS 2.3 software. For both variants, genotype success rate 
was at least 95% and no deviation (P > 0.05) from Hardy-Weinberg equilibrium 
was observed in any of the examined populations. 

Phenotyping. The 90th and 97th body mass index (BMI) percentiles were used as 
thresholds for childhood overweight and obesity, respectively, according to the 
recommendations of the European Childhood Obesity Group study in local reference 
populations’. Adult subjects were defined as normal (BMI < 25kgm ”), over- 
weight (25 < BMI<30kgm *) and obese (BMI =30kgm*) according to the 
International Obesity Task Force recommendations. 

In silico analysis of both p.R270H and p.R67C variants. Phylogenetic conser- 
vation of the part of GPR120 containing each non-synonymous variant was 
assessed using the UCSC genome browser (Vertebrate Multiz Alignment & 
Conservation), based on a phylogenetic hidden Markov model, phastCons*’. To 
predict the possible effect of both amino-acid substitutions on the structure and 
function of GPR120, we used several programs: the POLYPHEN (polymorphism 
phenotyping) web-based program***; PANTHER® (protein analysis through 
evolutionary relationships); the SIFT (sorting intolerant from tolerant) algo- 
rithm“; the SNAP (screening for non-acceptable polymorphisms) software’; 
and the PMUT web-based program™. 

Plasmid construction. A FLAG-human GPR120/pcDNA5/FRT/TO plasmid was 
constructed by ligating GPR120 complementary DNA into the multicloning site of 
the mammalian expression vector pCDNA5/FRT/TO (Invitrogen) with the amino- 
terminal FLAG tag. The point mutation for constructing the FLAG-human GPR120 
p-R67C/pcDNA5/FRT/TO and FLAG-GPR120  p.R270H/pcDNA5/FRT/TO 
plasmids was carried out using the following primers: p.R67C (sense: 5’- 
getgctegtgecetgccgacgacgcc-3’; antisense: 5’-ggcgtcgtcggcacgccaccagcacc-3’) and 
p-R270H (sense: 5’-agccaccagatccacgtgtcccagcaggac-3’; anti-sense: 5’-gtcctgctggga 
cacgtggatctggtggct-3’). All constructs were confirmed by DNA sequencing. 

Cell lines and cell culture. Flp-In T-REx-293 (T-REx 293) cells (Invitrogen) were 
used to develop inducible and stable cell lines expressing GPR120 (wild type, 
p-R270H or p.R67C). T-REx 293 cells were routinely cultured in Dulbecco’s modified 
Eagle’s medium (DMEM; Sigma) supplemented with 10% FBS, 100 ug ml * Zeocin 
(Invitrogen) and 10 pg ml! blasticidin $ (Funakoshi). T-REx 293 cells were trans- 
fected with FLAG-GPR120 (wild type, p.R270H or p.R67C)/pcDNA5/FRT/TO 
using Lipofectamine reagent (Invitrogen) and selected with DMEM, which had 
been supplemented with 10% FBS, 10pgml * blasticidin S$ and 100g ml * 
hygromycin B (Gibco BRL). GPR120 protein expression was induced by adding 
10 ug ml! of doxycycline hyclate (Dox; Sigma) for 48 h. Human NCI-H716 cells 
were obtained from the American Type Culture Collection (Manassas). Cells were 
grown in suspension in Roswell Park Memorial Institute 1640 medium supple- 
mented with 10% FBS, 100IU ml’ penicillin and 100 pg ml streptomycin. 
[Ca**], response analysis. Cells were seeded at a density of 2 X 10° cells per well on 
collagen-coated 96-well plates, incubated at 37 °C for 21h and then incubated in 
Hanks’ Balanced Salt Solution (HBSS, pH 7.4) containing Calcium Assay Kit 
Component A (Molecular Devices) for 1h at 20°C. ALA used in the fluorometric 
imaging plate reader (FLIPR, Molecular Devices) assay were dissolved in HBSS con- 
taining 1% DMSO and prepared in another set of 96-well plates. These plates were set 
on the FLIPR, and mobilization of [Ca**]; evoked by agonists was monitored. 
Transfection. One million cells were seeded into a 3.5-cm-diameter dish before 
transfection. NCI-H716 cells were transfected with 5 1g of each plasmid using 
Lipofectamine 2000 (Invitrogen) according to the manufacturer’s protocol. At 
24h post-transfection, transfection of each FLAG-tagged construct was confirmed 
by anti-FLAG FACS analysis. Then the cells were reseeded in 24-well culture plates 


coated with Matrigel matrix (BD Biosciences) at a density of approximately 
3 X 10° cells per well for the secretion studies. To test the effect of variant receptors 
on the ALA-induced [Ca”*]; response of the wild-type receptor, 2 X 10’ T-REx 
293 cells expressing Dox-inducible FLAG-GPR120 wild type were seeded into a 
15-cm-diameter dish before transfection. Cells were then transfected with 32 jig of 
each plasmid (empty vector, wild type and p.R270H GPRI120) using 
Lipofectamine 2000 (Invitrogen) according to the manufacturer’s protocol. At 
24h post-transfection, cells were reseeded at a density of 2X 10° cells per well 
on collagen-coated 96-well plates and treated with 10 jig ml! of Dox, and at 48h 
post-transfection, ALA-induced [Ca’*]; response was monitored. 

GLP-1 secretion analysis. Cells were serum-starved with FBS-free DMEM for 3 h, 
washed with HBSS and incubated for 2h at 37°C in 0.3 ml FBS-free DMEM 
containing DMSO (negative control), 141M phorbol 12-myristate 13-acetate 
(positive control) or 100.M ALA. Supernatants were collected and the active 
GLP-1 concentration in the supernatant was determined by enzyme immunoassay 
using GLP-1 (Active) ELISA Kit (Millipore). 

Flow cytometry analysis. Anti-FLAG (Sigma) and anti-HA (Roche) antibodies 
were used for staining. Data were acquired and analysed on FACSCalibur with 
CELLQUEST software (Becton Dickinson). 

Statistical analysis of human study. We assessed the effect of both non- 
synonymous variants (p.R270H and p.R67C) on obesity using a logistic regression 
adjusted first for age and gender and then for age, gender and geography, under an 
additive model, using the software R (version 2.12). Adjustment for geography was 
achieved to reflect a north-south gradient between the six different countries of 
origin of the study participants. An ordinal variable was created and coded: 1 for 
Finland, 2 for Belgium, 3 for France and Switzerland, 4 for Italy and 5 for Greece. 
This variable was added as a covariate in the logistic regression model. 
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Enzymatic catalysis of anti- Baldwin ring closure in 


polyether biosynthesis 
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Polycyclic polyether natural products have fascinated chemists and 
biologists alike owing to their useful biological activity, highly 
complex structure and intriguing biosynthetic mechanisms. 
Following the original proposal for the polyepoxide origin of 
lasalocid and isolasalocid’ and the experimental determination 
of the origins of the oxygen and carbon atoms of both lasalocid 
and monensin, a unified stereochemical model for the biosynthesis 
of polyether ionophore antibiotics was proposed’. The model was 
based on a cascade of nucleophilic ring closures of postulated 
polyepoxide substrates generated by stereospecific oxidation of 
all-trans polyene polyketide intermediates’. Shortly thereafter, a 
related model was proposed for the biogenesis of marine ladder 
toxins, involving a series of nominally disfavoured anti-Baldwin, 
endo-tet epoxide-ring-opening reactions* °. Recently, we identified 
Lsd19 from the Streptomyces Iasaliensis gene cluster as the epoxide 
hydrolase responsible for the epoxide-opening cyclization of 
bisepoxyprelasalocid A° to form lasalocid A”*. Here we report the 
X-ray crystal structure of Lsd19 in complex with its substrate and 
product analogue’ to provide the first atomic structure—to our 
knowledge—of a natural enzyme capable of catalysing the dis- 
favoured epoxide-opening cyclic ether formation. On the basis of 
our structural and computational studies, we propose a general 
mechanism for the enzymatic catalysis of polyether natural product 
biosynthesis. 

Epoxide-opening cascades, including those using the disfavoured 
anti-Baldwin cyclization (Fig. 1a), have emerged as an important 
synthetic strategy in the field of organic chemistry’®. Whereas little is 
known about ladder polyether biosynthesis, biosynthesis of ionophore 
polyethers (Fig. 1b) has been studied extensively". Although most 
cyclic ethers in ionophore polyethers are the favoured exo-cyclization 
products (Fig. 1b, in blue), several compounds, including lasalocid A, 
contain six-membered rings presumably formed via a disfavoured 
endo-tet cyclization (Fig. 1b, in red). 

Lsd19 was co-crystallized with the substrate analogue at pH 4.6, and 
the crystal structure was determined to 1.59 A resolution (Supplemen- 
tary Table 1). Lsd19 consists of two highly homologous domains 
(Fig. 2a) that belong to the nuclear transport factor 2 (NTF2)-like 
superfamily’*, which includes A°-3-ketosteroid isomerases (KSIs)!* 
and limonene-1,2-epoxide hydrolase (LEH)"*. Each domain consists 
of three a-helices and a cone-like six-stranded B-sheet whose cavity 
forms a deep substrate-binding pocket. The amino- and carboxy- 
terminal domains (Lsd19A and Lsd19B, respectively) are linked by a 
short loop and are oriented in a head-to-tail fashion, resembling the 
homodimer structure of single-domain NTF2-like proteins’’. Lsd19A 
and Lsd19B have identical backbone conformations; their respective 
117 core Ca atoms have a root-mean-squared deviation of just 0.79 A. 
There are two significant differences between Lsd19A and Lsd19B. 
First, Lsd19B has an additional 18-residue-long loop-helix—loop near 


the entrance to its substrate-binding pocket (Fig. 2a, in green). Second, 
the active site of Lsd19A contains an uncyclized bisepoxide substrate 
(Fig. 2b), whereas the active site of Lsd19B contains a doubly cyclized 
tetrahydrofuran-tetrahydrofuran (THF-THF) product. This differ- 
ence in active sites is consistent with our previously reported results 
that indicated that Lsd19A catalyses a single round of cyclization of 
the bisepoxide substrate, whereas Lsd19B cyclizes the initially 
formed THF ether/monoepoxide intermediate, predominantly to a 
tetrahydrofuran-tetrahydropyran (THF-THP) product'’. 

In Lsd19A, the uncyclized substrate analogue is seen in the active 
site. This observation is consistent with the fact that Lsd19 is inhibited 
by the low pH of the crystallization buffer (Supplementary Fig. 1a). 
This characteristic is shared by many EHs'*"*. The electron density for 
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Figure 1 | Polyether natural products and proposed steps in the cyclic ether 
formation. a, An example of a cascade epoxide ring closure implicated in the 
biosynthesis of ladder polyether. b, Structures of representative ionophore 
polyethers. c, Plausible mechanism of epoxide-mediated cyclic ether formation 
leading to the formation of lasalocid A, isolasalocid A and the product 
analogues. 
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Figure 2 | Crystal structure of Lsd19. a, The overall fold of Lsd19. The extra 
loop-helix-loop in Lsd19B and the corresponding insertion site in Lsd19A are 
shown in green. b, c, Lsd19A (b) and Lsd19B (c) electron density for the ligand 
(simulated-annealing omit map contoured at 2.0c, green mesh) and protein 
side chains (2F, — F. map contoured at 1.50, blue mesh). Carbon atoms of the 
bound ligands and protein are shown in yellow and orange, respectively. 
Oxygen, nitrogen and sulphur atoms are shown in red, blue and green, 
respectively. Green broken lines represent hydrogen bonds. d, e, Lsd19A 

(d) and Lsd19B (e) active-site hydrogen-bonding interactions between the 
ligand and protein side chains. The «1 helix (as labelled in panel a) was removed 
to clarify the view. f, g, Lsd19A (f) and Lsd19B (g) molecular surface cross- 
section, showing residues that determine the pocket size and shape. 


the oxazolidinone portion of the substrate analogue is ill-defined and 
therefore omitted from the final model (Fig. 2b). As the oxazolidinone 
is not present in the natural substrate, disorder in this region is not 
unexpected. In contrast, the portion of the substrate that contains the 
two epoxide groups that eventually undergo cyclization is clearly 
defined in the electron density map (Fig. 2b). The terminal 22,23- 
epoxide sits at the bottom of the pocket and is surrounded by hydro- 
phobic residues, protected from potential non-specific nucleophilic 
attack. The putative nucleophilic 15-hydroxyl oxygen is hydrogen 
bonded to Asp 38, whereas the oxygen atom of the electrophilic 
18,19-epoxide that undergoes ring opening lies within hydrogen- 
bonding distance to Tyr 14 and Glu65. Arg 54 is hydrogen-bonded 
to the catalytic Asp 38 via a water molecule, presumably promoting 
deprotonation and favourable positioning of the side-chain carboxylic 
acid of Asp 38 for ensuing proton abstraction from the 15-hydroxyl 
group. This active-site configuration is homologous to that of KSI’ 
and LEH", which perform similar acid/base catalysis (Fig. 3b). Unlike 
KSI or LEH, however, Lsd19 must perform conformationally sensitive 
intramolecular cyclizations on a flexible substrate. Additional inter- 
actions between Lsd19A and the 3-methylsalicylate side of the native 
substrate can help fix the ligand conformation for the ensuing cycliza- 
tion. Here, a water molecule hydrogen bonded by Thr 100 and Trp 128 
forms a hydrogen bond with the carbonyl oxygen of the bound sub- 
strate analogue (Fig. 2d). An equivalent oxygen atom is present in the 
natural substrate, bisepoxyprelasalocid A. 

In Lsd19B, an isolasalocid-like THF-THF compound is seen in the 
active-site pocket instead of the expected lasalocid-like THF-THP 
reaction product. This unexpected reaction compound is presumably 
a consequence of the acidic crystallization condition (Supplementary 
Fig. 1b). We have identified His 146, Asp 170, Glu 197 and Tyr 251 as 
primary candidates for the catalytic residues in Lsd19B (Fig. 2e). Like 
in Lsd19A, a hydrogen bond between Asp 170 and His 186 could raise 
the pK, of His 186. The 13-carbonyl oxygen is hydrogen bonded to 
Arg 177, providing an additional enzyme-substrate interaction. The 
binding pocket of Lsd19B is shallower than that of Lsd19A due to the 
presence of Met 253 at the bottom of the pocket (Fig. 2g). This allows 
only the singly cyclized monoepoxide to bind in a catalytically relevant 
orientation, and promotes the critical substrate selectivity in these 
highly structurally similar domains. Although the shallower binding 
pocket in Lsd19B affords less substrate-binding energy available for 
confining the flexible substrate into the catalytically relevant con- 
formation’, the additional loop-helix-loop structure (Fig. 2a, e, in 
green) seems to compensate for the lost substrate-binding surface. 

Analysis of known polyether biosynthetic gene clusters reveals that 
two copies of polyether epoxide hydrolase (PEH)-encoding genes are 
present within each cluster: monBI, monBII (monensin)'* and nigBI, 
nigBII (nigericin)’’. For Isd19 (lasalocid)*” and nanI (nanchangmycin)”°, 
two copies are encoded asa single gene. The tetronomycin gene cluster”! 
carries a single copy of PEH gene tmnB, presumably because tetrono- 
mycin contains only one epoxide-derived THF ring. Sequence align- 
ment of PEHs indicates that the catalytic residues identified in Lsd19 
are highly conserved (Supplementary Fig. 2). This alignment also 
reveals that PEHs can be divided into two groups: PEH-A (Lsd19A- 
like) and PEH-B (Lsd19B-like). Only PEH-B has a bulky residue at 
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position 104 that makes the binding pocket shallower, and only PEH-B 
contains a loop-helix-loop insertion that provides additional ligand- 
binding surface. 

To extend the structural knowledge gained from Lsd19, homology 
models of other PEHs were constructed. In these models, a direct 
correlation is observed between the depth of the binding pocket and 
the length of the substrate chain (Supplementary Fig. 3), leading us to 
propose a general mode of cyclic ether formation in polyether bio- 
synthesis. With the deeper binding pockets, PEH-As are probably 
responsible for the formation of internal cyclic ethers. Interestingly, 
PEH-As that act on longer substrates, such as nigericin, carry extra 
PEH-A-specific C-terminal residues (Supplementary Fig. 2) positioned 
at the opening of the binding pocket (Supplementary Fig. 3a), possibly 
providing additional binding surface for the longer substrates. In con- 
trast, PEH-Bs with shallower binding pockets probably catalyse the 
formation of terminal cyclic ethers. As seen in Lsd19B, the extra PEH- 
B-specific loop-helix—loop domain can provide the binding surface for 
the portion of the substrate protruding out from the shallow substrate- 
binding pocket. 

As the current Lsd19B structure does not address the formation of 
THP directly, we used computational studies to understand how this 
enzyme catalyses the disfavoured 6-endo cyclization (Methods and Sup- 
plementary Tables 2-4). Quantum mechanical density functional 
calculations were performed to locate each transition structure for both 
5-exo and 6-endo epoxide openings**”* under acid- and base-catalysed 
conditions using model substrates, and to calculate the energy differ- 
ence between the two processes (Fig. 3a and Supplementary Table 2). 
As expected”’, 5-exo-tet cyclization is favoured with both acid and base 
catalysis. However, the preference for the five-membered ring is reduced 
under base-catalysed conditions, due to the increased steric hindrance of 
nucleophilic attack at the more substituted epoxide terminus, as well as 
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Figure 3 | Computational studies of the Lsd19-catalysed epoxide-opening 
cyclization reactions. a, Acid- versus base-catalysed cyclization of a model 
system via 5-exo or 6-endo cyclization, and the respective calculated lowest- 
energy competing transition structures. BZ2PLYP/6-311++G(d,p)//B2PLYP/ 
6-31G(d) activation free energies (kcal mol ') and forming/breaking bond 
distances (in Angstroms, shown in blue) are given. b, Comparison of active-site 
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the more product-like transition structure with shorter forming O-C 
bond lengths. The six-membered product is indeed thermodynamically 
favoured. This provides an indication of how Lsd19B may achieve 
the six-membered ring formation. We then explored how Lsd19B 
would influence the regioselectivity of the ring-opening transition 
state. Docking studies on the Lsd19B structure indicated that 
Asp 170, Glu 197 and Tyr 251 maintain close contact with the bound 
ligand. Similarities between the active site organization of Lsd19B and 
KSI’ (Fig. 3b) further support the idea that Asp 170 acts as a general 
base, whereas Glu197 and Tyr251 act as general acids stabilizing 
the developing transition structure oxyanion. This arrangement is 
also remarkably similar to the ‘theozyme’ derived from computations 
for the anti-Baldwin-cyclization antibody”, which was eventually 
found to be closely related to the antibody structure”. To test this idea 
quantitatively, competing 5-exo and 6-endo transition structures were 
optimized in the presence of Asp 170 and Tyr 251, whose geometry was 
constrained to the crystal structure coordinates. From this model, 
values of the activation energy and Gibbs free energy changes for ring 
closure were computed. These predict a preference for the 6-endo 
pathway of 2.5 kcal mol ', enough to achieve nearly 100:1 selectivity 
(Fig. 3c). The positioning of the catalytic general acid and general base 
groups causes the 6-endo transition state to be favoured, leading to the 
thermodyamically more stable THP product. 

The structural knowledge gained here has provided a generalized view 
of the biosynthetic pathway involved in producing polyether natural 
products using only a pair of epoxide hydrolases. Furthermore, compu- 
tational studies have provided an explanation for the regiochemical pref- 
erence for the Lsd19-catalysed cyclization. Active-site pre-organization 
and general-base catalysis provides enzymatic control to overcome 
otherwise disfavoured chemical transformations. Similarly, stereo- 
chemically complex polyether ladder structures can be generated from 
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geometries in Lsd19B and KSI (PDB accession 1E3V™). c, “‘Theozyme’ 
calculations, optimizing the competing 5-exo-tet and 6-endo-tet transition 
structures surrounded by fixed catalytic residues. Relative energies obtained by 
B2PLYP/6-311+ +G(d,p)/M06-2X/6-31G(d) (kcal mol ~ 1) and forming/ 
breaking bond distances are given. 
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a simple polyepoxide substrate by Lsd19B-like EHs that can form 
templating THP unit(s) that facilitate the subsequent cascade of 
step-wise endo-tet-selective epoxide-opening cyclizations. 


METHODS SUMMARY 


Lsd19 was produced using a variation of the published procedure®. Lsd19 was 
purified using metal affinity, anion exchange and size-exclusion chromatography 
steps to >95% purity and yielding approximately 2 mg protein per litre of culture. 
Initial crystals of Lsd19 with the presence of the substrate analogue® were obtained 
from the Hampton Crystal Screen. Subsequent optimization resulted in a crystal 
that diffracted to 1.59A resolution at the Stanford Synchrotron Radiation 
Lightsource. Data were indexed and integrated using XDS*”. The structure of 
Lsd19 was determined by molecular replacement and refined by CNS” and 
REFMAC5 (CCP4)”’. Computational studies on the reaction mechanism were 
performed with B2PLYP/6-311++G(d,p) calculated energies on B2PLYP/6- 
31G(d) or M06-2X/6-31G(d) optimized geometries using the Gaussian 09 pro- 
gram”. Docking Studies were performed with Autodock Vina”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Lsd19 expression and purification. Production of Lsd19 was performed partly 
according to the procedure described elsewhere®. Briefly, pCold-based pK W620 
carrying the Isd19 gene and the chaperone-encoding vector pG-KJE8 (Takara Bio) 
were introduced into Escherichia coli BL21(DE3) strain. The culture was grown in 
Luria Broth medium to ODg¢oonm Of 0.6, and the production of N-terminal His- 
tagged Lsd19 was induced by 100M isopropyl-B-p-galactoside, 0.5 mgml ! 
L-arabinose and 5ngml ' tetracycline. The culture was incubated for another 
20h at 15°C. Cells were harvested by centrifugation, resuspended in 50mM 
sodium phosphate pH 7.4, 300mM sodium chloride, 10mM imidazole, 10% 
(v/v) glycerol and lysed by sonication. After centrifugation at 15,000g for 
45 min, the cleared supernatant was mixed with cobalt-agarose beads (Thermo 
Fisher Scientific). The mixture was incubated for 1h at 4°C with moderate 
shaking. After incubation, the mixture was loaded onto a column and washed 
with a wash buffer containing 50 mM sodium phosphate pH 7.4, 300 mM sodium 
chloride, 10 mM imidazole and 10% (v/v) glycerol. Lsd19 was eluted with a wash 
buffer supplemented with 150 mM imidazole. Fractions containing Lsd19 were 
pooled and exchanged into a buffer containing 20 mM Tris pH 8.5, 1 mM EDTA 
and 15mM f-mercaptoethanol. Lsd19 was further purified by anion exchange 
chromatography using a HiTrapQ XL column followed by gel filtration on a 
Superdex200 10/300 GL column (GE Healthcare Life Sciences) in 50mM 
potassium phosphate pH 7.0, 1 mM EDTA and 15 mM f-mercaptoethanol. The 
final protein was >95% pure as judged by polyacrylamide gel electrophoresis, and 
the yield was approximately 2 mg per litre of culture. The sample was further 
concentrated to 12 mg ml’ for storage. 

Crystallization, data collection and structure determination. Crystals were 
obtained by the hanging-drop vapour diffusion method. The bisepoxide substrate 
analogue was prepared as described’. After combining Lsd19 (6 mg ml ') with the 
substrate analogue (100 mM in methanol) at 12.5:1 ratio (v/v) on ice, the sample 
was mixed with an equal volume of 0.1 M sodium acetate pH 4.6 and 1.6 M sodium 
formate and equilibrated at 18 °C. Crystals were further optimized using an addit- 
ive 1,5-diaminopentane dihydrochloride in the presence of 200-700 ll Al’s oil in 
the well. Before flash freezing, Lsd19 crystals were transferred for about 1 min to a 
cryoprotectant solution comprised of the crystallization buffer supplemented with 
35% glycerol. X-ray diffraction data were collected from a single crystal to a 1.59 A 
resolution at beam line 7-1 of Stanford Synchrotron Radiation Lightsource (SSRL; 
SLAC National Accelerator Laboratory). Data collection was performed on a 
single crystal at 100K using a monochromatic X-ray at a wavelength of 
0.9795 A. Data were indexed and integrated using XDS**. Structure factor was 
obtained by CTRUNCATE™. The structure of Lsd19 was determined by molecular 
replacement with the program MOLREP” using the Pseudomonas putida biotype 
B KSI structure (PDB accession 1DMN)”’ as a search model. The model was built 
by Coot? and ARP/wARP” and refined by CNS*® and REFMACS5 (CCP4)”. 
Ramachandran plot of the refined model indicated that all of the residues assumed 
allowed backbone conformation with 98.2% within the favoured conformation. 
Other relevant data and refinement statistics are given in Supplementary Table 1. 
Computation. Quantum chemical calculations were performed using Gaussian 
09, revision A.2**. All geometries were fully optimized. Transition structures were 
identified by a single imaginary harmonic vibrational frequency. Optimizations 
were performed with default convergence criteria at the B2PLYP/6-31G(d) level of 
theory using a fine grid for numerical integration. Free energies were computed in 
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the harmonic approximation at 298 K and 1 atm, with unscaled zero-point vibra- 
tional energies. Docking Studies were performed with Autodock Vina”. 

Single point energy calculations were performed to ensure basis set convergence 
at the B2PLYP/6-311+ + G(d,p)//B2PLYP/6-31G(d) level. Calculations were per- 
formed at the ‘double-hybrid’ density functional level of theory with the B2PLYP 
functional”. This method replaces a fraction of the semi-local correlation energy by 
a non-local correlation energy expression that uses the Kohn-Sham orbitals in 
second-order perturbation theory and delivers improved energetics over hybrid 
density functionals. Results obtained at this level were also compared with the hybrid 
meta-generalized gradient approximation M06-2X density functional. B2PLYP and 
MO06-2X calculations are in accord, both qualitatively and quantitatively, over the 
preference for 5-exo-tet over 6-endo-tet in the acid- versus base-catalysed cyclization 
of the model systems. Absolute barriers showed some variations; however, the 
energy differences between the two cyclization modes were consistent. Changing 
between double and triple zeta-valence polarized basis sets showed only small effects 
(Supplementary Table 2). 

The effects of solvation were treated with single point energy calculations on the 
gas-phase geometries using a conductor-like polarizable continuum (CPCM) 
model” of water (p = 78.36) and dichloromethane (p = 8.93), defining the solute 
cavity by UFF radii. As for gas-phase values, B2PLYP and M06-2X results are in 
accord over preferred reaction pathway (Supplementary Table 3). 

‘Theozyme’ calculations were performed using a truncated model of the active 
site, constraining the positions of the non-hydrogen atoms in the catalytic residues 
Glu 197 and Tyr 251 in their crystal structure coordinates. All other atom positions 
are optimized. Electrostatic effects of the protein interior were described with a 
CPCM model of low dielectric constant, using diethyl ether as solvent (p = 4.24)°*. 
A variety of methods show consistent energy differences between 5-exo and 
6-endo pathways (Supplementary Table 4). Absolute energies and Cartesian coor- 
dinates for all species in the text are given in the Supplementary Data. Absolute 
gas-phase energies and free energies (latm, 298K) from B2PLYP/6-31G(d) 
optimization, B2PLYP/6-311+ +G(d,p) single point energies, single point energies 
including CPCM solvation (UAKS solute radii) are all given in kcalmol'. 


a a , 5 . =i! 
Imaginary harmonic vibrations are in cm ~. 
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An oxygenase that forms and deoxygenates 


toxic epoxide 


Robin Teufel, Thorsten Friedrich? & Georg Fuchs! 


Catabolism may give rise to toxic intermediates that compromise 
cell vitality, such as epoxide formation in the recently elucidated 
and apparently universal bacterial coenzyme A (CoA)-dependent 
degradation of phenylacetic acid’. This compound is central to the 
catabolism of a variety of aromatics, such as phenylalanine, lignin- 
related compounds or environmental contaminants”’. The key 
phenylacetyl-CoA monooxygenase (epoxidase) of the pathway, 
PaaABCE'™”, is also connected to the production of various primary 
and secondary metabolites*’, as well as to the virulence of certain 
pathogens’’°"'. However, the enzyme complex has so far not been 
investigated in detail. Here we characterize the bacterial multi- 
component monooxygenase PaaABCE that, surprisingly, not only 
transforms phenylacetyl-CoA into its ring-1,2-epoxide, but also 
mediates the NADPH-dependent removal of the epoxide oxygen, 
regenerating phenylacetyl-CoA with formation of water. We pro- 
vide evidence for a catalytic di-iron centre that is probably the key to 
the unprecedented deoxygenation of an organic compound by an 
oxygenase. Presumably, the bifunctionality is vital to avoid toxic 
intracellular epoxide levels if the subsequent catabolic steps are 
impeded. Our data suggest that detoxification is assisted by two 
thioesterases (Paal and PaaY) forming non-reactive breakdown 
products. Hence, PaaABCE may harbour an intrinsic escape mech- 
anism from its own toxic product and represents the archetype of a 
bifunctional oxygenase/deoxygenase. Analogous reactions may 
possibly be catalysed by other di-iron epoxidases. 

PaaABCE catalyses a crucial step in bacterial degradation of phe- 
nylacetic acid (paa), namely, the epoxidation of phenylacetyl (pa)-CoA 
(Fig. 1, Supplementary Fig. 1). Its encoding genes are present in ~16% 
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of completely sequenced bacterial genomes’. Among Archaea, it was 
mainly Halobacteria that acquired bacterial PaaABCE (Supplementary 
Fig. 2), as this group seemed inclined to take up bacterial genes'”. This 
enzyme is probably the most widespread member of bacterial multi- 
component monooxygenases that generally consist of oxygenase sub- 
units, a small effector protein, and a reductase component”. According 
to sequence comparison, subunit A (PaaA) is the catalytic oxygenase 
compound harbouring conserved EX;H-motives for iron binding and is 
a member of the class I di-iron proteins'*” that are known to catalyse 
diverse and functionally unrelated enzymatic reactions, such as methane 
monooxygenation or ribonucleotide reduction’. PaaE resembles 
reductases with [2Fe-2S] and flavin cofactor binding domains; PaaE 
probably transfers electrons from NADPH to the di-iron centre of 
subunit A’*°. Together, subunits PaaAE resemble benzoyl-CoA 
epoxidase BoxAB, which similarly transforms benzoyl-CoA into its 
ring-2,3-epoxide’*”*. 

We determined cofactors, metal content, and acid-labile sulphur 
content of purified recombinant PaaABCE from Pseudomonas sp. 
strain Y2, and we also recorded ultraviolet-visible and electron para- 
magnetic resonance (EPR) spectra (Supplementary Figs 3-5). The 
composition of the complex was PaaAzB3_4C,E, or multiples of this. 
Subunits B and C appear not to contain metals or cofactors, whereas a 
possible subunit PaaD did not copurify and had no effect on in vitro 
activity. It may be involved in maturation of PaaA or PaaE rather than 
catalysis'* (see discussion in Supplementary Information). We found 
6.3 + 0.1 mol Fe per mol of PaaA;E;, in good accordance with the 
presence of one [2Fe-2S] cluster per PaaE and one di-iron centre per 
PaaA, as further corroborated by the EPR spectra (Supplementary Fig. 6). 
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Figure 1 | Overview of reactions and proposed salvage reactions of 
phenylacetate degradation. Black arrows show catabolic steps leading to 
acetyl-CoA and succinyl-CoA. When oxidation of the labile aldehyde is 
impaired (A), the derivative 2-hydroxycyclohepta-1,4,6-triene- 1-carboxyl- 
CoA is spontaneously formed (B) and inhibits hydrolytic ring-cleavage (C) 
leading to ring-1,2-epoxyphenylacetyl-CoA (ep-CoA) accumulation (orange 


Derivative 


arrows). PaaABCE then prevents overflow of toxic epoxides through 
deoxygenation (D), assisted by pa-CoA thioesterase Paal (E), whereas 
2-hydroxycyclohepta-1,4,6-triene-1-carboxyl-CoA thioesterase PaaY 
diminishes inhibition (F) (green arrows). Some organisms use the (perhaps 
CoA-freed) derivatives as starting material for antibiotics and fatty acids 
(dashed arrows). For other enzymes and intermediates, see Supplementary Fig. 1. 
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One mol PaaE contained 1.9 + 0.2 mol acid-labile sulphur and 0.73 mol 
FAD, confirming the presence of one FAD and one [2Fe-2S] binding 
domain per subunit E. Ultraviolet—visible spectra were consistent with 
the presence of these cofactors (Supplementary Fig. 4). 

To test PaaABCE activity, NAD(P)H and oxygen consumption were 
monitored in photometric tests and with an oxygen optode (Methods), 
respectively. Epoxidase activity amounted to ~1.0Umg ' with 
NADPH and ~0.05 Umg ' with NADH (at 30°C and optimal pH 
8). As determined by high performance liquid chromatography 
(HPLC), pa-CoA was consumed ~1.5 times faster when ring-1,2- 
epoxyphenylacetyl-CoA (ep-CoA) was removed through subsequent 
enzymes PaaG and PaaZ (Fig. 1). PaaABCE exhibited high affinities for 
NADPH (apparent Michaelis constant K,, = 23 +3 uM), pa-CoA 
(Kn = 641 uM) and O2 (Ky =3+1UM). In the absence of pa- 
CoA, PaaABCE uncoupled and slowly reduced O, to water rather than 
H,0,, with a NADPH oxidation rate of 0.07 + 0.02 Umg ' (black 
dashed line, Fig. 2a). Addition of pa-CoA then switched the enzyme 
from oxidase to oxygenase (Fig. 2a, Supplementary Table 1). The non- 
physiological substrate benzoyl-CoA was not transformed to detectable 
products, but apparently first uncoupled and then permanently in- 
activated PaaABCE (Supplementary Fig. 6). An overview of the main 
features of PaaABCE is given in Supplementary Table 1. 

Surprisingly, when pa-CoA was virtually completely converted to 
ep-CoA, the NADPH consumption remained significantly higher 
(dashed purple line, Fig. 2a) than in the initially observed uncoupled 
state (that is, before pa-CoA addition). When oxygen concentration 
was monitored, the rate of O2 consumption was half that of NADPH 
consumption (Supplementary Table 2, Supplementary Fig. 7). Hence, 
the produced ep-CoA probably induced NADPH and O, consump- 
tion. To better understand this behaviour, we took sequential samples 
from a photometric test and immediately quenched the enzymatic 
reaction, followed by HPLC and mass spectrometry product analysis. 
We observed nearly complete back conversion of ep-CoA to pa-CoA 
when O; became limiting. Moreover, pa-CoA was again transformed to 
ep-CoA after adding O2 (sample 5 and 6, Fig. 2), indicating that 
PaaABCE also functions as a deoxygenase (epoxide reductase), whereas 
spontaneous ep-CoA decay produced 2-hydroxyphenylacetate and 
CoA (Fig. 2b and Supplementary Fig. 8). Importantly, PaaABCE strictly 
required NADPH to achieve deoxygenation of ep-CoA (Supplementary 
Fig. 9). The continuing NADPH consumption in the photometric test 
thus implied an equilibrium between ep-CoA deoxygenation and pa- 
CoA reoxygenation that was determined by the absence or presence of 
oxygen. To further test the deoxygenation hypothesis, we added '*O- 
labelled ep-CoA to PaaABCE in the presence of NADPH and air- 
saturated water. We observed the exchange of '*O for '°O in the epoxide 
over time (Supplementary Fig. 10). On the basis of our experiments and 
of established knowledge from other extensively studied di-iron 
enzymes (benzoyl-CoA epoxidase’®, soluble methane monooxygenase 
(sMMO)"’), we tentatively propose the following reactions (for steps 
and intermediates see Fig. 3). 

First, we consider PaaABCE in the presence of O2, NADPH and 
pa-CoA (continuous red line, Fig. 2a). On binding of the substrate, the 
di-iron centre ground state Fell( u-O)Fe™ of PaaA (compound 1, equi- 
valent to compound T of sMMO"*) is NADPH-dependently reduced to 
Fe!'Fe" (compound 2, equivalent to compound Hyeq of SMMO"’) with 
release of water (step I). This state then directly reacts with O, and 
generates a peroxy Fe'-O-O-Fe™ species, which rearranges to the 
key oxygenating intermediate FelV( -O) aFelY (‘diamond core’) or alter- 
natively O = Fe'Y(\1-O)Fe'Y in the presence of protons (compound 3, 
equivalent to compound Q of sMMO”°’”’) (step IIa). This allows for the 
epoxidation of pa-CoA and regeneration of the di-iron centre to com- 
pound 1 (step III), which prevalently occurs in the presence of pa-CoA 
and oxygen. 

Second, PaaABCE in the presence of O2, NADPH and ep-CoA 
(dashed purple line, Fig. 2a). Remarkably, the di-iron centre also deox- 
ygenates ep-CoA, which occurs when pa-CoA is largely used up. Once 
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Figure 2 | PaaABCE is a bifunctional monooxygenase/deoxygenase. 

a, NADPH consumption by PaaABCE (0.25 mg protein added per ml) was 
monitored photometrically in a 0.5-cm cuvette; maximum velocity (max. 
velocity indicated by thin line) was attained after addition of pa-CoA (+ 0.1 
mM phenylacetyl-CoA). NADPH consumption slowed and finally stopped 
completely when oxygen was depleted. NADPH consumption was restarted 
once the cuvette was shaken to introduce oxygen again. A change in absorbance 
at 365 nm wavelength (AA36snm) of 0.175 is equivalent to consumption of 
0.1 mM NADPH (é3¢snm Of 3,500 M~'cm7'). Samples 1 to 6 were taken at 
different time points; their analysis by HPLC is shown in b. Curve sections are 
illustrated by differently coloured dotted or continuous lines. The 
correspondingly coloured roman numerals indicate the reactions taking place 
(shown in Fig. 3). b, RP-HPLC analysis of samples from a. Product peaks were 
analysed by mass spectrometry. CoA (observed) and 2-hydroxyphenylacetate 
(not visible, see Supplementary Fig. 8 for proof) are released during 
spontaneous decay during which no other CoA-esters were observed. 
Compounds are highlighted with shadings in yellow (CoA), purple (ep-CoA), 
and green (pa-CoA). 


ep-CoA has bound, compound 1 is NADPH-dependently reduced to 
compound 2 with release of water as described above. Crucially, 
instead of reacting with O>, the reduced di-iron centre may abstract 
the epoxy-oxygen from ep-CoA with generation of pa-CoA and rest- 
ing state compound 1 (step IIb). Any formed pa-CoA will be converted 
back to ep-CoA by steps I, Ila and III. Hence, 2 NADPH are net 
consumed per 1 O, with formation of 2 H,O, whereas pa-CoA and 
ep-CoA equilibrate. As observed in our HPLC assays, the reaction rates 
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favoured the ep-CoA side under oxic conditions, implying somewhat 
slower (40%, see below) deoxygenation compared to oxygenation 
(100%) (Supplementary Table 1; sample 3, Fig. 2b). This conclusion 
is consistent with distinct decreases in NADPH and O, consumption 
rates when pa-CoA was largely used up (Fig. 2a, Supplementary Fig. 7, 
Supplementary Table 2). 

Third, PaaABCE in the absence of O, and in the presence of ep-CoA 
and NADPH (dashed blue line, Fig. 2a). When O, is depleted, only 
NADPH-driven deoxygenation occurs (steps I and IIb), eventually 
resulting in complete reduction of ep-CoA-derived oxygen to water 
via step I. Hence, 1 ep-CoA is converted to 1 pa-CoA under NADPH- 
consuming formation of 1 H,O until the ep-CoA pool is depleted 
(sample 4 and 5, Fig. 2b). In total, 1 NADPH is oxidized for each 
initially produced ep-CoA. 

According to our model, PaaABCE ultimately reduces all oxygen to 
water via step I (Fig. 3). Hence, 2 NADPH should be consumed for each 
O, reduced to 2 H,O, independent of the amount of pa-CoA added. To 
test this counterintuitive hypothesis, we started oxygen-limited (con- 
centration of O2 = 0.235 mM) photometric assays by adding 0.05, 0.1, 
0.2 and 0.4mM pa-CoA to the assay mixture. Indeed, a constant 
stoichiometry of ~1.8 NADPH oxidized: 1 O, added resulted. The 
small deviation from the expected stoichiometry of 2:1 can be explained 
by spontaneous partial decay of ep-CoA sparing 1 NADPH for each 
decomposed ep-CoA. In contrast, when produced ep-CoA was 
enzymatically removed, only 1 NADPH was consumed per 1 pa-CoA 
added (Supplementary Table 3). Photometric assays were analysed 
based on our reaction model, and a specific activity of ~0.4Umg ' 
was calculated for ep-CoA deoxygenation (~40% compared to pa- 
CoA oxygenation). The apparent K,, value of PaaABCE for ep-CoA 
was 17 +2 uM. 
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Figure 3 | Simplified scheme of proposed reactions catalysed by PaaABCE 
and states of the catalytic di-iron centre. Di-iron compounds are proposed in 
analogy to sMMO (see text). I, NADPH-dependent reduction of the ground 
state di-iron core to diferrous compound 2 under formation of water. Ia, the 
reduced state directly reacts with molecular oxygen and produces a high-valent 
intermediate (compound 3). IIb, Alternatively, the reduced state may abstract 
the epoxy-oxygen to produce pa-CoA and compound 1. III, compound 3 ortho- 
oxygenates pa-CoA to produce ep-CoA and compound 1. Note that high- 
valent compound 3 of sMMO has not yet been unambiguously identified 
(indicated with double-headed arrow). Green arrows indicate pa-CoA 
formation, purple arrows ep-CoA formation. Pa-CoA and ep-CoA are shown 
in green and purple boxes, respectively. 
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At first glance, deoxygenation of the epoxide may be incidental and 
counterproductive; still, this bifunctionality may also imply biological 
aims, as suggested by the high affinity for ep-CoA. But why must the 
cells prevent a rise in concentration of ep-CoA? First, the labile epoxide 
rapidly decays to unproductive 2-hydroxyphenylacetate (Supplemen- 
tary Fig. 8) that cannot be used’. Second, epoxides are notoriously 
reactive towards DNA and protein” and severely hamper cell growth”. 
Normally, hydrolytic ring-cleavage by PaaZ ultimately removes ep- 
CoA. However, we found that a product (2-hydroxycyclohepta- 
1,4,6-triene-1-carboxyl-CoA, see Fig. 1) that is rapidly formed when 
PaaZ function is impaired by NADP™ shortage® is a potent ring- 
cleavage inhibitor (Supplementary Fig. 11). Note that PaaZ specifically 
requires NADP* (apparent Km of 17 + 341M). Activity with NAD* 
was <5%; previous reports® showing activity with NAD* were based 
on absorption increase due to product rather than NADH formation 
(see also discussion in Supplementary Information). In Escherichia 
coli, the cellular NADP* concentration was reported to be 2 1M, in 
contrast to 12014M NADPH”, and cells maintain a high NADPH/ 
NADP* ratio independent of the growth substrate used. As bacteria 
inevitably encounter situations in which the NADPH formation 
exceeds its demand’’, short-term lacks of NADP* may well trigger 
ep-CoA accumulation (Fig. 1). We hypothesize that in this case, pa- 
CoA and ep-CoA both compete for PaaABCE, which catalyses an 
ongoing NADP*-restoring oxygenation/deoxygenation cycle that 
prevents indefinite epoxide rise. It is tempting to speculate that certain 
pathogens like Burkholderia cenocepacia found alternatives for epoxide 
disposal, for example, as part of the pathogen’s arsenal against host cells, 
which may explain increasing evidence for PaaABCE as a virulence 
factor!®"’. 

The detoxification process, however, proceeds more efficiently if the 
concentration of the competing substrate pa-CoA does not rise in con- 
sequence of ep-CoA deoxygenation. We speculate that thioesterase Paal, 
which exhibits high affinity for pa-CoA (apparent K,,~ 10 1M)”, 
removes excess pa-CoA. To test this, we compared relevant pathway 
intermediates as substrates for Paal (among others, see Supplementary 
Table 4) and found an atypical narrow substrate spectrum with 
reasonable activity exclusively for pa-CoA (1.6 + 0.1 Umg ‘5 22°C). 
Paal may thus act as a pa-CoA thioesterase that supports PaaABCE in 
controlling ep-CoA concentration (Fig. 1). Aptly, we found that PaaY 
constitutes a second, previously undescribed type of thioesterase, which 
specifically hydrolyses inhibitory 2-hydroxycyclohepta- 1,4,6-triene-1- 
carboxyl-CoA (7.6 + 0.6Umg |, apparent K,, 35+ 15M; 22°C) 
(Fig. 1, see Supplementary Table 4 for substrate specificity) to recover 
CoA and counteract ring-cleavage inhibition (Supplementary Fig. 11). 
Hence, PaaY may relieve epoxide accumulation and have a stimulatory 
effect on intracellular phenylacetate consumption. As expected, both 
thioesterases seem induced under growth on phenylacetate (Sup- 
plementary Table 4). We note that some organisms found alternatives 
for 2-hydroxycyclohepta-1,4,6-triene-1-carboxyl-CoA disposal and 
may use it as a starting material for the synthesis of w-cycloheptyl fatty 
acids and distinct antibiotics (Supplementary Fig. 1). 

Although the versatile di-iron proteins catalyse diverse reactions 
with complex underlying chemistry, such as ribonucleotide reduction, 
methane monooxygenation or A?-desaturation of stearoyl-acyl carrier 
protein”, they are commonly monofunctional, and none is able to 
recycle its substrate by using the primary product in an alternative 
reaction. Therefore, PaaABCE is outstanding among the di-iron 
proteins. The deoxygenation reaction is, from a thermodynamical 
perspective, enabled by the reactive epoxide that is significantly higher 
in energy than, for example, the phenolic alcohol that would arise from 
ring-hydroxylation. Of particular interest for further studies of 
PaaABCE are the di-iron species responsible for the key oxygenating 
and deoxygenation reactions. Apart from the still not clearly resolved 
di-iron compounds of sMMO, there are numerous other possibilities 
for reactive di-iron species, as reports of artificial biomimetic com- 
plexes indicate**”’. 
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In summary, bifunctional PaaABCE acts as a phenylacetyl-CoA ring- 
1,2-epoxidase/ring-1,2-epoxyphenylacetyl-CoA deoxygenase that may 
harbour an effective intrinsic safety mechanism allowing for undelayed 
restraint of epoxide formation. Thioesterases Paal and PaaY assist 
detoxification and prevent depletion of the CoA-pool. Recruitment of 
two separate thioesterases with an uncommon narrowed substrate spec- 
trum for ring-closed CoA esters might reflect the need to circumvent 
release of reactive ring-epoxides from ep-CoA. It seems that usage of 
labile and toxic intermediates in this biological system imposes elaborate 
escape mechanisms on the cells under stress conditions, allowing for 
recycling of phenylacetate that can be catabolized again once the distress 
eases. The intriguing question remains whether in other cases mono- 
oxygenases from the di-iron family (for example, alkene epoxidase”®, 
benzoyl-CoA epoxidase’®) similarly exploit the highly energetic epox- 
ides in analogous reactions and cooperate with other enzymes. 
Deoxygenation may then be a general means to control epoxide levels. 


METHODS SUMMARY 


Recombinant maltose-binding protein (mbp)-tagged PaanppABCE from 
Pseudomonas sp. strain Y2 and Paa,,,I and Paa,»,Y from E. coli K12 were 
heterologously expressed in E. coli and purified by amylose resin chromatography. 
Pa-CoA was chemically synthesized’; all other pathway intermediates were 
enzymatically synthesized and purified by reversed-phase (RP)-HPLC. 

For detection of metals, recombinant PaaABCE was analysed using inductively 
coupled plasma emission spectroscopy. PaaABCE was denatured to isolate the 
FAD cofactor which was identified by RP-HPLC. The acid-labile sulphur content 
was determined by photometrical quantification of methylene blue formed 
through reaction of released sulphide from denatured protein and N,N’- 
dimethyl-p-phenylenediamine, using Na,S*9H,O as the standard. Enzyme com- 
position was determined by quantifying the spot intensities for each subunit on 
SDS gels. Enzymatic activities and apparent K,, values for PaaABCE were deter- 
mined at 30°C by following NADPH consumption in photometric assays at 
365 nm or oxygen consumption using an oxygen optode. Paal and PaaY activities 
and apparent K,,, values were determined at 22°C using RP-HPLC assays with 
optical detection of CoA and CoA-esters that were separated with a C18-E column 
using an ammonium acetate-buffered (pH 6.8) acetonitrile gradient. 

For mass spectrometry (MS) measurements, RP-HPLC samples were directly 
injected into the TurbolonSpray ion source of an API 2000 LC/MS/MS mass 
spectrometer. EPR spectra of PaaABCE were recorded with a Bruker EMX 1/6 
spectrometer operating at X-band (9.4 GHz). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Materials. '*C-phenylacetate was obtained from Biotrend Chemikalien GmbH. 
All other materials were obtained from previously described sources’. 
Syntheses. Phenylacetyl-CoA was chemically synthesized as described before’. 
Pathway CoA-ester intermediates were enzymatically synthesized and purified 
in large scale reactions. For production of the derivative 2-hydroxycyclohepta- 
1,4,6-triene-1-carboxyl-CoA, a reaction mixture (4 ml) containing 0.06 mg PaaZ- 
E256Q (variant enzyme lacking aldehyde dehydrogenase activity), 0.3mg 
PaaABCE, 0.03mg PaaG, 1.55mM NADPH, 50mM Tris-HCl (pH 8.0) and 
0.6 mM pa-CoA was stirred for 7 min at 30°C. For other pathway intermediates, 
similar assays were performed using PaaABCE, PaaG, PaaZ and PaaJ as required. 
For ©Q-ep-CoA, a reaction mixture (10 ml) containing 2.3 mg PaaABCE, 1.5 mM 
NADPH, 50 mM Tris-HCl (pH 8.0) and 0.7 mM pa-CoA was stirred for 6 min at 
22°C. For '80-ep-CoA, a reaction tube (volume 7.5 ml) containing a 3 ml reaction 
mixture of 0.3mM pa-CoA, 1mM NADPH and 50 mM Tris-HCl (pH 8.0) was 
closed and degassed. A total volume of 6 ml '8O, gas was subsequently injected and 
the assay stirred for 2min. The reaction was started by addition of 0.05 mg 
degassed PaaABCE and stirred for additional 3 min at 22°C. After incubation, 
all reactions were stopped, the CoA-esters were purified as described before’ and 
kept frozen (—70°C). The CoA-esters were dissolved in water or 20 mM 2-(N- 
morpholino)ethanesulphonic acid (MES)-KOH pH 6.0 (ep-CoA, oxepin-CoA) 
before use. Other CoA esters were chemically synthesized or purchased as 
described before®. CoA thioesters were quantified as previously described’. 
RP-HPLC analysis and CoA-ester and radioactivity detection. Detection and 
separation of CoA-esters in different samples were carried out by RP-HPLC as 
described before’. The products of the conversion of [U-'*C]phenylacetate were 
analysed by flowthrough solid state scintillation counting of '*C, as described 
previously*'. The radioactive samples were separated using a Luna 5U C18 (2) 
100A column (250 X 4.60 mm, 5 jim) (Phenomenex). The column was developed 
ata flow rate of 1 ml min” ' bya gradient from 2% acetonitrile in 40 mM phosphate 
buffer (pH 4.0) to 70% acetonitrile within 32 min. Buffer was gradually exchanged 
for water during the gradient. 

Molecular biological techniques. All in silico cloning steps were performed with 
the program Clone Manager 7.11 (Scientific & Educational Software). The cloning 
procedures for paaABCDE, paaD, paaJ and paaG from Pseudomonas sp. strain Y2, 
and paaZ and mutated paaZ (encoding PaaZ-E256Q) from E. coli K12, and paaK 
from Thermus thermophilus were described before'***. Note that PaaD separated 
during purification of PaaABCDE. Chromosomal DNA was extracted using 
standard techniques. Genes for paaY from E. coli K12 were amplified using forward 
primer 5'-GGCGAATTCCCAATTTATCAGATAGACG-3’ and reverse primer 
5'-GCTAAGCTTTAACTCTGCTTTATAATTAAAGG-3’. Genes for paal from 
E. coli K12 were amplified using forward primer 5'-CTGAAGAGCTTCTATGA 
GTCATAAGGCCTGGC-3’ and reverse primer 5’- CCGTCTAGAATAAAGGCT 
TCACGCATCAG-3’. Introduced restriction sites are underlined; the forward 
primers contained EcoR1 (paaY) or XmnlI (paal) restriction sites, the reverse 
primers HindIII (paaY) or Xbal (paal) restriction sites. PCR was performed with 
a mixture (5:1) of Taq polymerase and Pfu polymerase (Genaxxon) for 30 cycles, 
including denaturation for 45s at 95°C, annealing for 45s at 65°C (paal) or at 
55 °C (paaY), polymerization for 1 min at 72 °C and a final extension at 72 °C for 
5min. The PCR products were isolated and cloned into the expression vector 
pMAL-c2X (New England Biolabs), which adds a maltose-binding protein (mbp) 
to the amino terminus of the expressed proteins. 

Analysis by mass spectrometry (MS). Samples of RP-HPLC fractions were 
directly injected into the TurbolonSpray ion source of an API 2000 LC/MS/MS 
mass spectrometer (Applied Biosystems). For online mass detection only the Q1 
quadrupole was used. Measurements covered the range of m/z from 800 to 1,000. 
Bacterial strains, growth conditions, and protein purification. PaaABCE, 
PaaG, PaaK, PaaZ and PaaZ-E256Q were produced, purified, and stored as 
described before’***. The production, purification and storage of the heterologously 
produced proteins Paal and PaaY was performed as described for PaaABCE!. 
Protein analysing methods. Enzyme fractions were analysed by SDS-12.5% 
polyacrylamide gel electrophoresis*’. Proteins were visualized by Coomassie 
brilliant blue R-250 staining. Protein concentrations were determined by the 
Bradford method** with bovine serum albumin as a standard. Bands of 
PaaABCE subunits were excised, and the peptides digested and analysed by LC- 
MS/MS as described before*’. The relative protein amounts of PaaA, B, C and E of 
different enzyme preparations were calculated by determining the spot intensities 
of each subunit on a SDS-gel using a ChemiDoc XRS Universal Hood II and 
Quantity one software (Version 4.6.3) (both Biorad). To determine the 
stoichiometric composition of the purified complex, the spot intensity was corre- 
lated with the molecular mass of each protein (subunit A was corrected for the size 
of untagged protein). For iron and cofactor analysis, the absolute amounts of PaaA 
and PaaE within the respective sample were then calculated by means of Bradford 
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protein quantification and correlated with the determined amount of iron, acid- 
labile sulphur, and FAD. 

Enzyme measurements. Enzymatic reactions were carried out at 30°C 
(PaaABCE, PaaZ) or 22°C (Paal, PaaY) and were buffered with 50mM Tris- 
HCl (pH 8.0) ifnot stated otherwise. At least two independent measurements were 
performed. The reactions were stopped with 1% formic acid resulting in a pH drop 
to pH 4.5-5 (stronger acidification rapidly destroyed ep-CoA), and denatured 
protein was subsequently removed by centrifugation. For Paal and PaaY, the 
activities were corrected for the size of untagged protein. The apparent K,, value 
of PaaZ for NADP* was determined in photometric tests using variable amounts 
of NADP* (0.025-1 mM) and constant amounts of oxepin-CoA (0.2 mM) in the 
presence of excess PaaY. 

A spectrophotometric assay was generally used for the measurement of 
PaaABCE activity, in which the oxidation of NADPH (or NADH for cofactor 
specificity) was followed spectrophotometrically at 365 nm (é3¢snm for NADPH of 
3,500 M_!cm~! and for NADH of 3,400 M_! cm’) in 1.0-cm or 0.5-cm cuvettes. 
Occasionally, separately produced PaaD was added to test for possible involve- 
ment in catalysis. To test benzoyl-CoA as a non-physiological substrate for 
PaaABCE (Supplementary Fig. 6), NADPH oxidation was monitored in a 
photometric test (300 pl) containing 0.12 mg PaaABCE and 0.5mM NADPH. 
The reaction was started with 0.05-0.3 mM benzoyl-CoA. After NADPH oxida- 
tion nearly halted, 0.2 mM pa-CoA was added, followed by subsequent addition of 
fresh PaaABCE (0.1 mg ml '). NADPH consumption by PaaABCE was also mea- 
sured photometrically in the absence of substrate. For this purpose, a reaction 
mixture (0.2 ml) containing 1 mM NADPH and 0.4 mg PaaABCE was monitored 
at 30°C. The cuvette was covered with paraffin oil to prevent oxygen diffusion and 
the assay recorded until NADPH consumption stopped due to lack of oxygen. 
Approximately 2 NAPDH were consumed per O3. Also, addition of 0.1 mg cata- 
lase had no effect on NADPH consumption, indicating that O2 was reduced to 
water rather than to H2O>. 

The NADPH consumption was determined for varying pa-CoA concentra- 
tions of 0.05, 0.1, 0.2 and 0.4mM in air-saturated water at 30°C (0.235 mM 
dissolved O,) (Supplementary Table 3). For this purpose, photometric tests 
(0.8 ml) containing 0.2 mg PaaABCE and 1 mM NADPH were monitored until 
NADPH oxidation stopped. The cuvettes were covered with paraffin oil. 
Decomposed ep-CoA would cause somewhat lower oxidation of NADPH due 
to trapped oxygen in 2-hydroxyphenylacetate according to the reaction model in 
Fig. 3. Hence, HPLC samples were withdrawn and the amount of decomposed 
ep-CoA was calculated by quantifying the peak area of released CoA. The values 
were then used to correct the expected amounts of oxidized NADPH. 
Furthermore, to determine the pa-CoA:NADPH stoichiometry when ep-CoA 
is removed through subsequent steps, PaaG and PaaZ-E256Q mutant enzyme 
lacking aldehyde dehydrogenase function (instead of PaaZ to avoid recycling of 
NADPH) were added to photometric tests. The presence of PaaY assured that 
arising 2-hydroxycyclohepta-1,4,6-triene-1-carboxyl-CoA (which absorbs at 
365 nm) was removed. 

To determine the pH optimum of PaaABCE, reaction mixtures at 22 °C were 
buffered either with 100 mM MES-KOH (for pH values 6.5-7.0) or 100 mM Tris- 
HCl (for pH values 7.5-9.0) and were started by addition of 0.1mM pa-CoA. 
NADPH oxidation was followed spectrophotometrically and the specific activities 
were compared between pH 6.5 to 9 in 0.5 pH steps. 

The apparent K,,, values for pa-CoA, ep-CoA and NADPH were determined in 
photometric tests. For pa-CoA, variable amounts of pa-CoA (5-100 tM) were 
added to a reaction mixture containing saturating NADPH (130M) and O, 
(0.235mM). The Km value for NADPH was determined by adding variable 
amounts of NADPH (10-250 11M) to the assay, while the pa-CoA concentration 
was kept constant (100 1M). The K,,, value for ep-CoA was indirectly determined 
by adding variable amounts of pa-CoA (2.5-100 1M) to assays with constant 
NADPH concentrations (0.5 mM). The NADPH decrease eventually slowed down 
and adopted a second constant rate when pa-CoA was largely converted to ep- 
CoA. The velocity of this second rate was analysed for different amounts of 
produced ep-CoA, and the apparent K,,, value was calculated. The apparent K,, 
value for O2 was determined by observing the oxygen consumption in an 
enzymatic test (0.5 ml), using a MICROX TX3 fibre-optic oxygen meter (‘oxygen 
optode’) (PreSens GmbH), which was calibrated at 30 °C with air-saturated water 
(100% corresponding to 0.235 mM O,)’° and freshly prepared 60 mM dithionite 
(0%). The reaction mixture contained 0.28 mg PaaABCE, 0.03 mg PaaG, 0.1 mg 
PaaZ, 0.7mM pa-CoA, 1mM NADPH and 1mM NADP’. The apparent K,, 
value was graphically determined and corresponded to the curve point at which 
half-maximal velocity was attained. 

To test the effect of ep-CoA withdrawal on PaaABCE catalysed substrate (pa- 
CoA) consumption, subsequent enzymes PaaG and PaaZ that ultimately remove 
ep-CoA were added to the enzymatic assay. The assays contained 1 mM NADPH, 
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1mM NADP‘, 0.5mM pa-CoA, and either only 0.03 mg PaaABCE or 0.03 mg 
PaaABCE plus 0.05 mg PaaG, and 0.3 mg PaaZ. 

To compare the velocities of oxygen and NADPH consumption (Sup- 
plementary Table 2, Fig. 2, Supplementary Fig. 7), enzymatic assays (800 il) 
containing 1mM NADPH and 0.12 mg PaaABCE were started by addition of 
0.1mM pa-CoA at 30°C and were either monitored for NADPH decrease 
(photometrically) or for oxygen consumption (using a calibrated oxygen optode, 
see above). 

Also, oxygen consumption by PaaABCE was monitored under pa-CoA/ 
NADPH limitation with following RP-HPLC product analysis at different time 
points (Supplementary Fig. 9). The assay (1.3 ml) at 30°C contained 0.23 mg 
PaaABCE, 0.22 mM NADPH, and was started by addition of 0.1 mM pa-CoA. 
The reaction tube was covered with paraffin oil to prevent oxygen diffusion. After 
oxygen consumption stopped, 0.2mM NADPH was added. Different samples 
(1-4) were taken as indicated in Supplementary Fig. 9, immediately stopped, 
and analysed by RP-HPLC. 

Specificity of thioesterases PaaI and PaaY. To determine thioesterase activity of 
Paal and PaaY (Supplementary Table 4), formation of CoA from different sub- 
strates was quantified by determining the relative peak areas of substrates and 
products at different time points in RP-HPLC assays at 22 °C. For Paal activity, 
assays (0.15 ml) containing 0.08 mg Paal and 0.25-0.5mM pa-CoA were incu- 
bated for 2 min, stopped, and analysed. For PaaY activity, reaction mixtures 
(0.1ml) containing 641g PaaY and 0.5mM _ 2-hydroxycyclohepta-1,4,6- 
triene-1-carboxyl-CoA were incubated for 2 min, stopped, and analysed. Other 
substrates (each 0.5 mM) were tested accordingly (see Supplementary Table 4). To 
determine the apparent K,,, value of PaaY for 2-hydroxycyclohepta-1,4,6-triene-1- 
carboxyl-CoA, varying amounts (0.02, 0.05, 0.2 and 1 mM) of it were added to 
enzymatic tests (25-100 pl) containing small amounts of PaaY (0.6-1.2 1g). The 
reactions were stopped after 2min and analysed by RP-HPLC. To test for 
induction of thioesterase activity during growth on phenylacetate, E. coli K12 cells 
were grown on phenylacetate as sole carbon and energy source as described 
previously’. Similarly, E. coli cells were grown on glucose as sole carbon and energy 
source complemented with vitamins and trace elements in phosphate-buffered 
mineral salt medium”. Cells were harvested during exponential growth phase and 
disrupted using a French pressure cell. After ultracentrifugation, the supernatant 
cell extract) was used to determine thioesterase activity in HPLC assays (see 
above) with pa-CoA or 2-hydroxycyclohepta-1,4,6-triene-1-carboxyl-CoA as 
substrates. 

Effect of PaaY on ring-cleavage. To test the effect of PaaY on ring-cleavage 
Supplementary Fig. 11), a time course of oxepin-CoA (0.5 mM) consumption 
and product formation by recombinant purified PaaZ (4.1 pg ml") was recorded 
using RP-HPLC either in the absence of NADPH and PaaY, in the presence of 
NADP* (1.5mM) and the absence of PaaY, or in the presence of NADP* 
1.5 mM) and PaaY (44 pg ml !). 

Separation of substrates and products of PaaABCE and MS analysis. To 
observe the enzymatic formation of ep-CoA and pa-CoA during a photometric 
test (1.5 ml), a reaction mixture containing 0.1 mM pa-CoA, 1 mM NADPH and 
0.25mgml-' PaaABCE was monitored at 365nm. After NADPH oxidation 
stopped, the cuvette was vigorously shaken to supply oxygen. Different samples 
were withdrawn (Fig. 2) and the samples analysed by RP-HPLC. CoA-ester peaks 
were collected for mass spectrometry analysis. 

'80 labelling experiments. (Shown in Supplementary Fig. 10.) To determine the 
PaaABCE-catalysed deoxygenation of '*O-ep-CoA and subsequent reformation 
of '°O-ep-CoA in the absence of pa-CoA, a reaction mixture (620 ll) containing 
0.12mM '8O-ep-CoA and 1mM NADPH was started by addition of 0.09 mg 
PaaABCE and stirred at 22 °C. Samples were taken before reaction start and after 
1, 2, 4 and 8 min. As a control, two samples after 2 and 4 min were taken from a 
second assay without NADPH. Also, an assay (800 1) containing 0.2 mg 
PaaABCE and 1mM NADPH was started by addition of 0.1mM pa-CoA in 
80% H,'*O/20% H,'°O. After conversion of pa-CoA to ep-CoA, deoxygenation/ 
reoxygenation was observed for additional 2 min before a sample was taken in 
which no '8O-ep-CoA was detected. Ep-CoA was purified by RP-HPLC and 
frozen in liquid nitrogen until MS measurement. 

Detection of 2-hydroxyphenylacetate as decay product of ep-CoA. (Shown 
in Supplementary Fig. 8.) To show that 2-hydroxyphenylacetate arises from 
spontaneous ep-CoA decay, phenylacetate-CoA ligase PaaK (1 mg ml _') was used 
to enzymatically synthesize '*C/"*C-pa-CoA from a mixture of '*C/'*C-phenyla- 
cetate. A sample was analysed by HPLC using UV and radioactivity detection 
before NADPH (0.5 mM) and PaaABCE (0.7 mg ml ') were added. The reaction 
tube was incubated at 40°C and vigorously shaken to supply oxygen. Further 
samples were taken after 1, 3 and 10 min and analysed accordingly. 

Inhibition by EDTA. To test the effect of metal chelating agents, 0.17 mg 
PaaABCE were incubated with 50mM EDTA (pH 8.0) for 3h at 30°C. As a 


control, equal protein amounts were incubated without EDTA. Both fractions 
were then used for photometric activity tests as described above. 
Determination of the FAD cofactor. Purified PaaABCE (94 jug) was denatured 
for 1h in the dark with 3 pl of 70% HClO, to liberate cofactors. The sample was 
centrifuged (10,000g, 10min, 4°C) and 100 ul of the yellowish supernatant 
containing the isolated cofactor were transferred to 831 of 1M sodium 
phosphate buffer, pH 7.1 and analysed by RP-HPLC using a C18-E column 
(LiChrospher 100 RP, end-capped, 5 tm, 125 X 4mm) and a flow rate of 1 ml 
min !. The retention times of flavin adenine dinucleotide (FAD) and flavin 
mononucleotide (FMN) under isocratic conditions in 9% acetonitrile and 
20 mM ammonium acetate (pH 6.0) were 5.5 and 8.5 min, respectively. A distinct 
peak eluted after 5.5 min retention time. The cofactor was identified as FAD by 
retention time, and the amount of cofactor extracted was determined by com- 
parison of the peak area with that of FAD standard solutions (0.05-10.0 1g FAD 
per injection). 

Determination of acid-labile sulphur content. The acid-labile sulphur content 
of (dithioerythritol-free) PaaABCE was determined by denaturing PaaABCE in 
alkaline medium under release of sulphide, which was coprecipitated with 
Zn(OH), as ZnS. Acidification then produced H,S, which was condensed with 
N,N’-dimethyl-p-phenylenediamine yielding methylene blue, as quantified 
photometrically at 670 nm, using NaS-9H)O as the standard*’. 

Spectroscopic analysis of PaaA,,,,BCE. (Shown in Supplementary Fig. 4.) 
Ultraviolet—visible spectra against the blank buffer solution were recorded using 
a Cary Biol100 spectrophotometer (Varian). A spectrum of oxidized Paa,,},ABCE 
(0.24mgml~') was monitored in the range 220-600nm. To visualize FAD 
cofactors and [2Fe-2S]-clusters, 5.5 mg ml! PaaA,,ppBCE were recorded in the 
range 320-600 nm and then completely reduced by addition of 0.7 mM dithionite. 
Reduced enzyme was then exposed to air to re-oxidize PaaA,ppBCE over time, 
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EPR measurements. (Shown in Supplementary Fig. 5.) EPR measurements were 
conducted with a Bruker EMX 1/6 spectrometer operating at X-band (9.2 GHz). 
The sample temperature was controlled with an Oxford Instruments ESR-9 
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Metal detection. For detection of metals, 7.1mg of recombinant purified 
PaaABCE was analysed by inductively coupled plasma emission spectroscopy 
(ICP-OES) as described before**. 

Computational analysis. (Shown in Supplementary Fig. 2.) Apparent K,, and 
Vmax Values were calculated using GraphPad Prism4 software. The BLASTP*” 
searches were performed via the NCBI BLAST server (http://www.ncbi.nlm. 
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completely sequenced archaeal genomes. The percentage of archaeal species, 
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evolutionary history was inferred using the Neighbour-Joining method’. The 
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conducted in MEGA4. 
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ENVIRONMENT 


Toxic effects 


Environmental concerns and more stringent laws are 
providing opportunities for environmental toxicologists. 


BY AMANDA MASCARELLI 


hen oil began gushing into the Gulf 
of Mexico after BP’s Deepwater 
Horizon drilling rig exploded in 


April 2010, politicians and the public were 
braced for the worst. Oil lapped up on hun- 
dreds of kilometres of beach and coastal sands 
stretching from Florida to Texas, and threat- 
ened Louisiana's ecologically sensitive wet- 
lands. Although the wetlands and shorelines 
faced substantial damage, these regions were 
spared the brunt of the disaster, the largest 
marine spill in US history. Instead, much of 
the oil and gas — and between 2 and 3 mil- 
lion litres of dispersant that BP used to help 
break up the oil — drifted through the sea in 
deep underwater plumes, exposing marine 
organisms, many of which were in the vulner- 
able reproductive stages of their life cycles, to 


pollutants for several months. 

Many scientists were alarmed by the 
unprecedented use of dispersants in the deep 
ocean. Little was known about how the dis- 
persants and oil would interact, how toxic 
they were to many marine organisms, how 
long they would persist in the deep-water 
environment at potentially damaging con- 
centrations and what affect this would have 
on the growth and reproduction of the Gulf 
of Mexico's myriad organisms. 

Toxicologists working for government 
agencies, local universities and private con- 
sultants — along with fisheries biologists, 
oceanographers and ecologists — set to work 
to find the answer. “It’s like CSI meets [Jacques] 
Cousteau,” says Susan Shaw, a marine toxicolo- 
gist and director of the Marine Environmen- 
tal Research Institute, a non-profit institution 
based in Blue Hill, Maine. 


During the gulf spill, which took nearly 
three months to contain, Shaw helped to 
analyse the effects of the oil and dispersants 
on marine animals. Later she took part ina 
scientific working group, assembled by the 
US Department of the Interior, that was tasked 
with assessing the effects of the spill on fisher- 
ies, wildlife and human health, and with mak- 
ing policy recommendations. This real-world 
application of her work is what drew Shaw to 
environmental toxicology 30 years ago. She 
still finds it deeply gratifying. 

Environmental toxicologists aim to under- 
stand the effects of chemicals and physical 
agents — such as dust, mould, cigarette smoke, 
vehicle exhaust, pesticides, radiation and heavy 
metals — on both human health and ecosys- 
tems. Some work in risk assessment, relying 
heavily on statistical computations and math- 
ematical modelling to determine whether an 
agent is likely to be hazardous ata certain level 
of exposure. Others are mainly chemists or 
biologists who tackle issues such as how pol- 
lutants travel and disperse in air and water, and 
how these agents affect humans, animals and 
ecosystems. “It takes the team of those people 
to put the whole picture together,’ says Jennifer 
Field, an environmental chemist at Oregon 
State University in Corvallis, who traces 
pharmaceutical products and illicit drugs in 
wastewater systems as an alternative indicator 
of drug-use trends. 

In the 1970s and 1980s, the field of environ- 
mental toxicology expanded rapidly, especially 
in the United States, triggered mainly by the 
1962 publication of Rachel Carson’s Silent 
Spring — an exposé of the path that pollut- 
ants, particularly pesticides, take through the 
environment — and by the establishment of 
the US Clean Water Act and an array of air- 
quality laws. Although the field has not been 
immune to the economic depression of the 
past few years, toxicologists are in demand 
in government, academia, private industry 
and consultancies across the United States 
and internationally. In the United States, toxi- 
cologists held around 117,000 jobs in 2010, up 
3.5% from 2008, according to an analysis of 
US Bureau of Labor Statistics by members of 
the Society of Toxicology in Reston, Virginia, 
an organization representing scientists from 
academia, business and government, and 
presented as a poster at the their 2011 Annual 
Meeting. The analysis predicts that the number 
of US toxicologists will grow to some 153,000 
by 2018. 

New legislation could fuel a greater > 
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Beijing’s Forbidden City surrounded by smog at midday. 


> demand. In the European Union, for exam- 
ple, the Registration, Evaluation, Authorisa- 
tion and Restriction of Chemical Substances 
(REACH) law took effect in June 2007. Phased 
in over 11 years, REACH aims to create a 
standardized, rigorous approach to testing and 
evaluating chemicals before they reach the 
market. The law will create jobs for toxicolo- 
gists across all sectors, including academia, 
government, business and consulting firms 
that will require 


a blend of techni- “Toxicologists 

cal and managerial grein demand 

skills, says Mark in government, 

ue ise eae academia, 

mental toxicologis ° ° 

and founder ofWCA vee indus try 
and consulting 


Environment, an 
environmental con- 
sultancy based in Faringdon, UK. They will 
be needed to help to find efficient and effective 
ways to meet the often complex requirements 
of REACH, says Crane. Knowledge of how the 
chemical industry and its supply chains work 
will be a big advantage. “You need to be able to 
speak knowledgeably to a client, to give them 
strategic as well as tactical advice,” he says. 
For instance, to help a client meet REACH 
mandates, it is important to be able to under- 
stand the company’s specific needs, such as 
whether the regulations require animal test- 
ing — which can bea lengthy and costly pro- 
cess — or if there is a more efficient method 
to satisfy the regulations. 

In fact, there may not be enough well- 
trained toxicologists to fulfil the demands of 
REACH in the public and private sector in 
Europe in the coming years, says Alan Boobis, 
a toxicologist at Imperial College London. 
Others have suggested the same, includ- 
ing in a 2007 European Commission report 
(see go.nature.com/zbtkdi), which noted 
the lack of toxicology training programmes. 
The problem, says Boobis, is that toxicolo- 
gists with the integrated training required for 


risk assessment are close to retirement. Fur- 
thermore, many universities in Europe offer 
programmes that do not provide the broader 
scope of training needed for the applied sci- 
ence of toxicology. “With greater specializa- 
tion, we are losing this expertise in Europe, 
and without an adequate career track there is 
little incentive to pursue such a career.” 

There are, however, some notable excep- 
tions, including Britain’s Cranfield Univer- 
sity, which provides tailored postgraduate 
programmes that allow students to combine 
policy and applied work, says Crane. He also 
points to the University of Insubria in Varese, 
Italy, which offers a doctoral programme in 
environmental chemistry and ecotoxicology. 
The Netherlands has a handful of universi- 
ties that offer good toxicology programmes, 
including Wageningen University and Utrecht 
University. 

Toxicologists generally acquire expertise 
in one or two areas, focusing on, for instance, 
mathematical modelling and biostatistics, epi- 
demiology, molecular biology, environmental 
chemistry or pharmacology. Most toxicology 
jobs require PhD training, although there are 
opportunities for master’s students in govern- 
ment and elsewhere. Some of the most impor- 
tant skills that graduates entering the field can 
bring are social: understanding their clients’ 
needs and collaborating with other experts. 
“The people who are going to be successful are 
people who have that deep expertise in their 
own field yet have the professional capabilities 
to collaborate, and to open their mind and to 
really listen,” says Dave Stone, director of the 
US National Pesticide Information Center, a 
cooperative between the Oregon State Uni- 
versity and the US Environmental Protection 
Agency in Washington DC. 

The United States has at least 125 toxicol- 
ogy programmes, some of which concentrate 
ona specific subfield such as ecotoxicology, 
inhalation toxicology, metals toxicology or 
policy. Indiana University in Indianapolis, 
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for example, is a centre of ecotoxicology and z 
chemical carcinogenesis; Michigan State 5 
University in East Lansing focuses on food 4 
toxicology; and The University of Roches- 
ter in New York specializes in the effects of = 
toxic chemicals on bones. George Gray, a risk 
analyst at George Washington University in 
Washington DC, says that the university's pro- 
gramme emphasizes policy, producing com- 
petitive graduates. “It makes them attractive to 
organizations when they understand not just 
the science but how it ends up being used,” 
says Gray. A great toxicology programme will 
“allow students to work with professors in a 
laboratory setting, have the opportunity to 
contribute to publications as co-authors, and 
take a wide range of courses in toxicology, 
pharmacology, chemistry and ecotoxicology,’ 
says Margaret Whittaker, managing director 
and chief toxicologist at ToxServices, a con- 
sulting firm based in Washington DC that 
assesses health risks from chemicals. 


CHAE! 


REAL-WORLD PROBLEMS 
In Europe, REACH puts the burden of proof- 
of-safety for chemicals on the chemical indus- 
try itself; this is not the case in the United 
States, where advocates have been fighting for 
an overhaul of existing US legislation for years. 
The US Toxic Substances Control Act of 1976 
is widely viewed as outdated and fractured, 
even by members of the chemical industry. 

“The US law currently takes the view that 
chemicals are innocent until proven guilty 
and REACH takes the opposite view — that 
chemicals are suspect unless the companies 
making them and using them can prove that 
they're safe,’ says Monica Becker, a sustainabil- 
ity consultant based in Rochester, New York. If 
the burden of proof were to shift, says Becker, 
more chemical testing and expert interpreta- 
tion of toxicity data would be required, which 
would create a demand for toxicologists. But 
California is already preparing to implement 
regulations that will require consumer prod- 
uct manufacturers to investigate whether their 
products contain ‘chemicals of concern, and 
suggest substitutes. Whittaker expects these 
regulations to create many private-sector jobs 
for toxicologists. 

Legislation adopted in China in 2010, known 
as China REACH, is similar to that in Europe 


MICHAEL NEWMAN 


Erica Holloman is hoping to make a difference. 


and will drive the country’s demand for tox- 
icologists, says Kenneth Leung, an aquatic 
toxicologist at the University of Hong Kong. 
He says that the number of industry and aca- 
demic postgraduate and postdoctoral jobs in 
the field is growing and this is likely to con- 
tinue as China attempts to meet demands for 
efficiency, cleaner air and water, and action 
on addressing climate change. 

“North America confronted decades ago 
the problems now emerging in Asia, so indi- 
viduals trained in environmental toxicology 
can make a substantial contribution in Asia,’ 
says Michael Newman, an ecotoxicologist 
at the Virginia Institute of Marine Science 
in Gloucester Point. China has a num- 
ber of well-known training programmes, 
including a programme at the Guangzhou 
Institute of Geochemistry of the Chinese 
Academy of Sciences, known for its moni- 
toring of environmental pollution, and the 
Research Center for Eco-Environmental 
Sciences, also part of the Chinese Acad- 
emy of Sciences in Beijing, which is work- 
ing to develop an early-warning system for 
environmental contamination. 

Globally, the field is moving away 
from a focus on individual chemicals 
and towards understanding the cocktail 
of chemicals that pervade the environ- 
ment, says Linda Birnbaum, head of both 
the National Institute of Environmen- 
tal Health Sciences in Research Triangle 
Park, North Carolina and the US National 
Toxicology Program, a federal interagency 
programme. “People are finally beginning 
to realize that nobody is exposed to one 
chemical at a time,” says Birnbaum. “We 
live in a soup. We've got to go beyond look- 
ing at one chemical or one exposure at a 
time and start looking in a more integrated 
fashion.” In the future, toxicologists will 
need to consider cumulative effects of 
many exposures and work as part of mul- 
tidisciplinary research teams. 

Most environmental toxicologists say 
they are highly satisfied with their work, 
partly because they can apply their exper- 
tise to pressing, real-world problems. 
Erica Holloman, who recently completed 
her PhD in toxicology at the Virginia 
Institute of Marine Science, is building on 
her doctoral work, assessing the effects of 
mercury in seafood and other pollutants 
in a low-income community in Newport, 
Virginia, through a community grant from 
the US Environmental Protection Agency. 
“I wanted to see my work make a difference 
in everyday life,” says Holloman. “I found 
this was a way that my passion for science 
and the research could lead to something 
that was dear to my heart, and that I could 
really have an impact in.” = 


Amanda Mascarelli is a freelance writer 
based in Denver, Colorado. 


TURNING POINT 


CAREERS 


Sean Bendall 


Sean Bendall, a postdoctoral fellow in stem-cell 
and cancer biology at Stanford University in 
Palo Alto, California, won the Dale F. Frey 
Award for Breakthrough Scientists from the 
Damon Runyon Cancer Research Foundation 
in New York on 9 January 2012. 


How did you decide where to do your PhD? 
After earning my bachelor’s degree in bio- 
chemistry and microbiology at the University 
of Victoria in Canada, I decided to work for 
ayear at a proteomics facility while I applied 
to graduate programmes. I was accepted to 
some very prestigious places, including the 
University of Oxford, UK and the Institute for 
Systems Biology in Seattle, Washington. The 
decision was tough because the research pro- 
ject that excited me the most was not happen- 
ing at Oxford or in Seattle. I wanted to work on 
the mechanism that makes an embryonic stem 
cell continue to be a stem cell and stops it from 
differentiating into another type of cell, and I 
was able to do this at the University of Western 
Ontario in London, Canada. 


Was it a good decision? 

I definitely do not regret deciding to follow my 
passion. My stem-cell work ended up mak- 
ing a big splash in Nature (S.C. Bendiall et al. 
Nature 448, 1015-1021; 2007), even though it 
tooka while to get the paper out. I think if you 
could tell any PhD student that their project 
would culminate in a high-profile paper in a 
top journal, they would consider that the best- 
case scenario. As it was, I left little bits of my 
soul behind on a long journey to get to that 
end point. 


What did you learn while writing that paper? 
How to tell the story better. Like many aca- 
demics, I was blind when it came to my own 
research. I had to find a way to write that laid 
out how my observations could help to guide 
science in a new direction, rather than simply 
saying the research so far had been wrong. 


How many iterations did the paper undergo? 

Several. The first version was two stories — the 
method development and the actual stem-cell 
analyses — that were poorly tied together. 
We sent it to one journal, not Nature, and the 
reviews were not good. But the reviewers did 
suggest experiments that could strengthen the 
paper, which helped us to describe, in a new 
version, how some of the pathways that we had 
thought were most important in stem-cell cul- 
tures were not even acting on the stem cells, 
but on the support cells. Once I combined my 


work with a colleague's, a bigger story emerged 
of how growth factors help to control how 
human embryonic stem cells differentiate into, 
say, bone marrow or skin. 


Are there downsides to training at the cutting 
edge of science? 

There can be. In 2008, next-generation flow 
cytometers were so new that I was almost a year 
into my postdoc before we got the instrument 
in the laboratory. Until then, I had to send my 
samples to the manufacturer in Toronto. Still, 
once we got everything running well, we were 
able to quickly publish a paper that illustrated 
the technology’s capabilities and potential. 
Every project in our lab is now using this tech- 
nology. It is an exciting time. 


Has your ongoing success delayed your efforts 
to seek a permanent position? 

Yes. [ havent applied for any jobs yet because 
everything is going so well, and I want to focus 
on the research. But I realize that I need to put 
myself on the job market and that now is prob- 
ably best because my work is going so well. 
Receiving the US$100,000 Dale FE. Frey award 
is a nice nest egg for when I get my own lab. 


What is your secret for success? 

Research topics in the top journals are prob- 
ably not the best PhD or postdoc topics. They 
already have enough people working on them. 
I like finding the empty space in research in 
which no one else is poking around. And my 
publication record is good partly because I work 
in multidisciplinary groups. Some researchers 
think that they should do everything on their 
own, but that can take so much longer, and a 
collaborative project might also result in a better 
interpretation of the data because the case has to 
be proved to many disciplines. m 
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and will drive the country’s demand for tox- 
icologists, says Kenneth Leung, an aquatic 
toxicologist at the University of Hong Kong. 
He says that the number of industry and aca- 
demic postgraduate and postdoctoral jobs in 
the field is growing and this is likely to con- 
tinue as China attempts to meet demands for 
efficiency, cleaner air and water, and action 
on addressing climate change. 

“North America confronted decades ago 
the problems now emerging in Asia, so indi- 
viduals trained in environmental toxicology 
can make a substantial contribution in Asia,’ 
says Michael Newman, an ecotoxicologist 
at the Virginia Institute of Marine Science 
in Gloucester Point. China has a num- 
ber of well-known training programmes, 
including a programme at the Guangzhou 
Institute of Geochemistry of the Chinese 
Academy of Sciences, known for its moni- 
toring of environmental pollution, and the 
Research Center for Eco-Environmental 
Sciences, also part of the Chinese Acad- 
emy of Sciences in Beijing, which is work- 
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need to consider cumulative effects of 
many exposures and work as part of mul- 
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partly because they can apply their exper- 
tise to pressing, real-world problems. 
Erica Holloman, who recently completed 
her PhD in toxicology at the Virginia 
Institute of Marine Science, is building on 
her doctoral work, assessing the effects of 
mercury in seafood and other pollutants 
in a low-income community in Newport, 
Virginia, through a community grant from 
the US Environmental Protection Agency. 
“I wanted to see my work make a difference 
in everyday life,” says Holloman. “I found 
this was a way that my passion for science 
and the research could lead to something 
that was dear to my heart, and that I could 
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Receiving the US$100,000 Dale FE. Frey award 
is a nice nest egg for when I get my own lab. 


What is your secret for success? 

Research topics in the top journals are prob- 
ably not the best PhD or postdoc topics. They 
already have enough people working on them. 
I like finding the empty space in research in 
which no one else is poking around. And my 
publication record is good partly because I work 
in multidisciplinary groups. Some researchers 
think that they should do everything on their 
own, but that can take so much longer, and a 
collaborative project might also result in a better 
interpretation of the data because the case has to 
be proved to many disciplines. m 
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Uta SCIENCE FICTION 


THE PI THAT WASN'T ROUND 


BY DAN ERLANSON 


dents laughed. 
James gave them a look to keep 
them quiet. “Very funny, Amisha. Try again” 

“But I'm serious, Mr Wong — look!” 

Amisha wasn't the brightest kid in fourth 
grade, but she was hard-working, and she 
certainly wasn't the class clown. James 
walked over to investigate. Along with 
everyone else, Amisha had used a compass 
to draw a circle on a piece of paper and 
then used a string and a ruler to measure 
its circumference and diameter. His class 
was celebrating March 14, Pi Day, and the 
students had been reading off their meas- 
urements. A ten-year-old and a piece of 
string can get you only so many significant 
figures, but Amisha was way outside the 
margin of error. James watched as Amisha 
remeasured her circle: 41.6 centimetres in 
circumference, 10.4 centimetres in diam- 
eter. The ratio expected for a square, not 
a circle. 

“Try drawing another,’ James suggested. 
Amisha took a fresh piece of paper, carefully 
drew, and then measured her new circle: 
23.1 centimetres in circumference, 7.4 centi- 
metres in diameter. “3.1?” Amisha asked, a 
little uncertain. She picked up the string and 
turned back to her original drawing. 

“Perfect!” James said. “Ill keep this other 


» 


one. 


cc { .0!” Amisha shouted. The other stu- 


After classes ended, James closely studied 
Amisha’s drawing: just a plain sheet of paper 
with a circle filling up about a quarter of the 
page, slightly off-centre. He carefully meas- 
ured it: 41.6 centimetres in circumference, 
10.4 centimetres in diameter: C/d=4.0. Did 
it feel heavier than a normal sheet of paper? 
Was there a certain luminescence to it? 

He took it down the hall to the staff room 
and made a photocopy. It looked identical, 
but James measured it again: 32.8 centi- 
metres in circumference this time, still 
10.4 centimetres in diameter: C/d=3.15. 
He scanned the drawing, saved the image, 
printed it and measured again: 10.4 centi- 
metres in diameter, 32.8 centimetres in cir- 
cumference. Whatever was strange about 
Amisha’ circle, it couldn't be copied. 

All evening he thought about the non- 
circular circle. His aborted PhD thesis was 
on a hypothetical particle that could cause 
spatial distortions if it interacted with ordi- 
nary matter. Could one of these ‘strangelets’ 
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How to square a circle. 


have collided with Amisha’s paper? 

James needed someone to help him think 
this through. And although he dreaded 
reconnecting, he realized that the best per- 
son to turn to was his old adviser, Professor 
Myers. He waited until after dinner — and 
a stiff drink — before making the call. Of 
course Myers was still in his office. 

“Tt’s been a while I know, but I have some- 
thing to show you. Some- 
thing I think you'll find 
interesting.” 

There was a pause, 
then: “James? Gotten 
bored with the brats?” 

“Can I come over?” 

“Still remember how 
to find the department? 
Give me a call when 
you arrive and I'll buzz 
you in” 

On his drive to the 
university James thought 
about the last conver- 
sation he had had with 
Myers. It was an early 
spring evening like this 
one, and James was clear- 
ing out his desk in the 
physics department. He 
had already told Myers 
about his decision to 
leave, and Myers had said little, but didn’t 
hide his disappointment. As James was tak- 
ing down the last of his photographs, he felt 
someone approach from behind. 

“So what are you going to do?” Myers 
asked. 

“T don't know; I was thinking of teaching 
elementary school.” 

“Teach!” Myers yelled. “Kids are even 
dumber than graduate students!” Then, ina 
lower voice: “You were the brightest student 
in your class — why do you want to throw 
that away?” 

Because I was becoming too much like you, 
a grasping egotist with no interest in anything 
or anyone except his own reputation, James 
thought furiously, but he replied simply: 
“Tve gotta go.” 

James had continued reading about 
physics in the years since. The work he had 
been doing remained untested and seem- 

ingly untestable, and 


NATURE.COM Myers sank into deeper 
FollowFutureson obscurity, even among 
Facebook at: physicists, although he 
go.nature.com/mtoodm © apparently remained as 
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abrasive as ever. James had not been the first 
student to leave without graduating, and he 
certainly wasn't the last. 


The building still had the same faint electri- 
cal odour that James remembered. Myers’ 
office door was open as usual so he could 
keep track ofhis group. You quickly learned 
to arrive early or leave late — preferably 
both. 

“So, what have you 
got? Need help grading?” 

James couldn't help 
smiling. “In a sense, yes. 
You remember the pro- 
ject I was working on? 
Calculating the cross- 
section of strangelets —” 

“Of course I remem- 
ber — damn shame you 
didn't keep at it” 

“T didn’t come here to 
argue about the past,” 
James managed evenly. 
He showed Myers the 
drawing and explained 
his observations, then 
watched as his former 
adviser measured it again 
and again before getting 
up and shutting the door 
to his office. 

“Do you realize what this may mean? This 
could be the first experimental evidence for 
my theories.” 

“Our theories,’ James reminded him. 

“Yes of course, you'll get credit for finding 
this. But let me keep it for a while — I want 
to see if temperature or magnetic fields have 
an effect on the measurements.” 

There was something James didn’t like in 
Myers voice. “That’s OK, Pll hold onto it for 
now, he said, picking up the paper. 

“Wait, you don’t realize how important 
this is!” Myers snatched at the drawing and 
pulled. 

There was an audible rip, and the room 
filled with a brilliant emerald light. The 
paper seemed to shrivel in upon itself, 
becoming smaller and smaller until it disap- 
peared altogether with a soft sizzling sound. 
The preternatural light vanished, and the 
two men were left staring at the empty space 
between them. m 


Dan Erlanson is a chemist who mostly 
enjoyed graduate school and is still on good 
terms with his PhD adviser. 
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MEGF10 and MEGF11 mediate homotypic interactions 
required for mosaic spacing of retinal neurons 


Jeremy N. Kay', Monica W. Chu' & Joshua R. Sanes! 


In many parts of the nervous system, neuronal somata display 
orderly spatial arrangements’. In the retina, neurons of numerous 
individual subtypes form regular arrays called mosaics: they are 
less likely to be near neighbours of the same subtype than would 
occur by chance, resulting in ‘exclusion zones’ that separate 
them’~*. Mosaic arrangements provide a mechanism to distribute 
each cell type evenly across the retina, ensuring that all parts of the 
visual field have access to a full set of processing elements’. 
Remarkably, mosaics are independent of each other: although a 
neuron of one subtype is unlikely to be adjacent to another of 
the same subtype, there is no restriction on its spatial relationship 
to neighbouring neurons of other subtypes’. This independence 
has led to the hypothesis that molecular cues expressed by specific 
subtypes pattern mosaics by mediating homotypic (within-subtype) 
short-range repulsive interactions'*°. So far, however, no molecules 
have been identified that show such activity, so this hypothesis 
remains untested. Here we demonstrate in mouse that two related 
transmembrane proteins, MEGF10 and MEGF11, have critical roles 
in the formation of mosaics by two retinal interneuron subtypes, 
starburst amacrine cells and horizontal cells. MEGF10 and 11 
and their invertebrate relatives Caenorhabditis elegans CED-1 
and Drosophila Draper have hitherto been studied primarily as 
receptors necessary for engulfment of debris following apoptosis 
or axonal injury'’ *. Our results demonstrate that members of this 
gene family can also serve as subtype-specific ligands that pattern 
neuronal arrays. 

The retina contains over 70 neuronal subtypes, divided into broad 
categories of photoreceptors, interneurons and retinal ganglion cells’. 
To seek molecules involved in cell-cell recognition events during 
retinal circuit assembly, we purified 13 subtypes of retinal neurons 
from transgenic mice and used microarrays to inventory the genes 
they expressed'*’”. We collected cells at postnatal day (P)6, a time at 
which synapse formation and mosaic refinement are underway. From 
this data set we identified genes selectively expressed by specific sub- 
types, including starburst amacrine cells (SACs), an interneuronal 
subtype that has critical roles in motion detection’®. The approxi- 
mately 100 genes that met our criteria for selective SAC expression 
included most known SAC markers as well as potential new markers 
(Fig. la and Supplementary Table 1). Thirty-one of the novel genes 
were tested by in situ hybridization or immunohistochemistry in com- 
bination with SAC markers such as choline acetyltransferase (ChAT) 
or calbindin’’. This secondary screen yielded 26 genes (Supplementary 
Table 1), among which were two homologues, Megf10 and Megf11. 
These genes encode transmembrane proteins with multiple epidermal 
growth factor (EGF)-like domains, a single membrane-spanning 
region, and a cytoplasmic domain with several binding sites for signal 
transduction components'*”’ (Supplementary Fig. 1). At P5-6, both 
genes were strongly expressed in SACs (Fig. la-c). Both were also 
expressed in horizontal cells (HCs), which were not part of the data set. 

SACs are present in both the inner nuclear and ganglion cell layers, 
and form independent mosaics in each°. These mosaics develop during 
late embryonic stages, as newborn SACs migrate from the site of their 


birth, the outer neuroblast layer, to their final laminar locations. SACs 
begin to exhibit mosaic spacing upon arrival at their destinations, 
presumably due to contacts with their homotypic neighbours*’. As 
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Figure 1 | Expression of Megf10 and Megf11 in SACs and HCs. a, Relative 
expression level of Megf10, Megf11 and the known SAC markers Chat, Calb1 
(encoding calbindin) and Js/1 (islet 1) in 13 amacrine (green), bipolar (blue) and 
retinal ganglion cell (black) subtypes analysed with microarrays. Level in SACs 
set to 1 for each gene. Isl was also detected in ON bipolar cells as previously 
reported”’. Abbreviations for subtypes are defined in Supplementary Table 2. 
b, Schematic of retina. INL, inner nuclear layer with horizontal, bipolar, 
amacrine, and Miiller glial cells; IPL, inner plexiform layer with synapses 
among bipolar, amacrine and ganglion cells; GCL, ganglion cell layer including 
displaced amacrine cells; ONL, outer nuclear layer containing photoreceptors; 
OPL, outer plexiform layer with photoreceptor synapses. SACs (red), HCs 
(green) and Miiller cells (dark grey row) are indicated. ¢, In situ hybridization 
for Megf10 (left panel; red in right panel) combined with anti-calbindin 
immunohistochemistry (green in right panel) at P5 shows expression in SACs 
(black arrows) and HCs (arrowheads). d, Megf10 (red) in E16 retina. Islet 1 
(green) marks SACs migrating through the outer neuroblast layer (ONbL) and 
in the INL’. Megf10 (red) is expressed by migrating SACs as they arrive in the 
INL (arrowhead), but not at earlier stages of their migration through the ONbL 
(arrows). e, Megf10 expression appears in Miiller glia (Mii) and is lost from 
SACs and HCs by P14. f, Double-label immunostaining for MEGF10 (green) 
and ChAT (red). MEGF10 protein localizes to the somata and processes of 
developing SACs (also see Supplementary Fig. 1). Scale bars, 20 [1m (c-e) or 
10 ttm (f). 
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new SACs are added to the array, local cellular rearrangements maintain 
mosaic spacing'*’. Mosaic spacing is maintained even as SAC dendrites 
grow to overlap with those of their neighbours”; thus, mosaicism is 
distinct from the phenomenon called tiling, which minimizes dendritic 
overlap’. Co-staining at embryonic day (E)16 with the early SAC 
marker islet 1 (refs 9, 21) demonstrated that newborn SACs activated 
Megf10 expression as they finished their migration and became inte- 
grated into mosaics (Fig. 1d). Megf10 expression persisted in SACs 
from E16 to the first postnatal week, and began in HCs at PO (Sup- 
plementary Fig. 1 and data not shown). In the second postnatal week, 
Megf10 was downregulated in neurons but appeared in Miiller glia 
(Fig. le), consistent with previous reports that Megf10 is expressed 
by brain glia’. To determine the subcellular localization of MEGF10 
in neurons, we generated an antibody to recombinant protein (Sup- 
plementary Fig. 1). MEGF10 was present both on the somata of SACs 
and HCs and on their processes (Fig. 1f and Supplementary Fig. 1). As 
expected, immunoreactivity levels on SACs and HCs were highest 
during the first postnatal week, and then declined (data not shown). 
Thus, Megf10 is expressed by SACs and MEGF 10 protein is present on 
SAC processes during the time that mosaics form. Because Megf11 
expression was not observed in retina until after SAC mosaics had 
formed (see later), we focused first on Megf10. 

To investigate whether MEGF10 is required for SAC development, 
we generated mutant mice (Supplementary Fig. 2). Megfl0 mutants 
were viable and fertile and their retinas exhibited no gross abnormalities. 


Wild type 


ees « 
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Mutant SACs migrated to the inner nuclear and ganglion cell layers as in 
controls, they were present in normal numbers, and their dendrites 
projected to appropriate sublaminae of the inner plexiform layer 
(Fig. 2a—-c). Examination of whole mounts showed, however, a marked 
loss of regular spacing among SAC somata, suggesting that their mosaic 
arrangement had been disrupted (Fig. 2a and Supplementary Fig. 3). 
To assess the degree to which SAC mosaics were disrupted, we 
measured the exclusion zone—the region surrounding each SAC in 
which other SACs are rarely found. This parameter is calculated from 
the density recovery profile, a plot of cell density as a function of 
distance from each SAC in the array”* (Fig. 2d). The SAC exclusion 
zone in mutants was smaller than in wild-type littermates, and was 
approximately equal to the diameter of a SAC soma (Fig. 2e). As the 
only limitation on proximity was soma size, SACs seem to be posi- 
tioned randomly in Megf10 mutants. This conclusion was supported 
by two additional measurements of spatial order: the packing factor”, 
another index of regularity calculated from the density recovery pro- 
file; and an independent measure, the Voronoi domain regularity 
index, which quantifies variations in the area of the territories nearest 
to each cell in an array*”**. In each case the index calculated for SAC 
arrays in Megf10 mutants was similar to that measured for computer- 
generated random arrays, whereas SAC arrays from wild-type littermates 
were highly ordered (Fig. 2f, g); heterozygotes showed a mild disorgan- 
ization (Supplementary Fig. 4). Together these results suggest that 
MEGF10 acts in SACs to impose a minimal intercellular spacing; in 
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Figure 2 | Loss of SAC mosaic spacing in Megf10 mutant mice. a, SAC 
mosaic in inner nuclear layer (INL) of wild-type (left) and Megf10~ (centre) 
retina, revealed by whole-mount staining with anti-ChAT. Wild-type mice 
have evenly spaced SAC somata, whereas mutants exhibit clumps and gaps 
similar to those seen in a simulation of a random cellular array (right). See 
Supplementary Fig. 3 for similar results in ganglion cell layer (GCL) SACs. 

b, ChAT-stained retinal sections from wild-type and Megf10 mutant animals. 
Laminar positions of SAC somata and processes are normal in mutants, even in 
regions where somata are clumped. c, Density of SACs and VGLUT3* 
amacrine cells is similar in wild-type (+/+) and Megf10 mutant (-/-) retina. 
d, Density recovery profiles (DRPs) for the SAC (INL) and VGLUT3* 
amacrine arrays. Graphs show the density of cells ina ring of radius x, relative to 
the density of cells in the image as a whole. Dashed line, DRP of random point 
array. The exclusion zone characteristic of mosaic spacing is measured as a dip 
below this line. e, Exclusion zone radius measured from d. Dashed line, 
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expected result for an array of cells distributed randomly; that is, the diameter of 
a single cell. Increases above this minimum indicate spatial order. The mutant 
SAC exclusion zone radius was similar in size to a SAC cell diameter, and was 
significantly smaller than wild type (*P < 0.0001). VGLUT3~ amacrine 
exclusion zones were unaffected. f, g, SAC packing factor (f) and Voronoi 
domain regularity index (g) were significantly lower in Megf10~ mice than in 
wild-type littermates (*P < 0.0001). Dashed line, mean for arrays of cells 
distributed randomly. Wild-type SACs and VGLUT3 © cells of both genotypes 
were non-randomly arrayed. The SAC array in mutants was not significantly 
different from random arrays (f, P = 0.16; g, P = 0.48). h, Morphology of single 
GCL SACs, labelled with adeno-associated virus driving membrane-targeted 
Cherry fluorescent protein, showed no gross abnormalities in Megf10 mutants 
(n= 8 cells each genotype). Data from P15 (a—g) or P80 (h) mice. Scale bars, 
50 um. Error bars, s.e.m. 
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its absence SACs assume random positions relative to each other. By 
contrast, other amacrine cell types examined (VGLUT3- (also known 
as SLC17A8) and tyrosine hydroxylase-positive) as well as bistratified 
direction-selective retinal ganglion cells, which are prominent synaptic 
targets of SACs'*, showed normal mosaic spacing (Fig. 2d-g and 
Supplementary Fig. 3). 

In mice lacking the adhesion molecules DSCAM or DSCAMLI, 
certain retinal cell mosaics form normally, then degrade secondary 
to hyperfasciculation of their neurites*’°. To investigate whether 
MEGF10 acts in a similar indirect fashion, we labelled individual 
SACs in mutants. Lack of MEGF10 had no obvious effect on SAC 
dendritic morphology (Fig. 2h). Moreover, soma disorganization 
was already evident by PO (Supplementary Fig. 5), shortly after mosaics 
form, indicating that MEGF10 affected formation of SAC mosaics. 

Next we asked whether MEGF10 is also essential for formation of 
the HC mosaic. Loss of MEGF10 had only a modest effect on HC 
regularity (Fig. 3). We therefore considered that MEGF11 might have 
a redundant role. Megf11 was not expressed in embryonic retina, but it 
appeared in HCs and SACs during the first postnatal week and per- 
sisted into adulthood (Fig. 3a—d). Importantly, SAC mosaics , to 
form before P0, but HC mosaics are established postnatally**’. W 
therefore generated Megfl1 mutant mice (Supplementary Fig. 2), 
which, like Megfl10 mutants, showed no gross retinal abnormalities 
(Supplementary Fig. 6). Examination of HC arrays in Megf11 mutants 
revealed a modest decrease in regularity similar to that in Megf10 
mutants, whereas in Megf1 ov; Megfl1— double-mutant animals, 
the HC mosaic was severely disrupted (Fig. 3e-m and Supplemen- 
tary Fig. 7). By contrast, SACs were unaffected by loss of Megfl1 
function, and were no more affected in double mutants than in 
Megf10 single mutants (Supplementary Figs 6 and 8). Thus, Megf11 
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is dispensable for SAC arrangement, but acts together with Megf10 to 
shape the HC mosaic. 

To elucidate the cellular mechanisms by which Megf10 and Megf11 
act, we used a gain-of-function approach in which we introduced 
MEGF10 into the retina by electroporation of plasmid DNA at PO. 
We first tested the hypothesis that MEGF10 can act as a signal that 
repels SACs, creating the exclusion zone that defines mosaic spacing. 
The electroporation method predominantly transfects dividing cells, 
leading to expression in neurons that exit the cell cycle postnatally, 
such as bipolar cells, Miiller glia, photoreceptors and late-born amacrine 
cells’. Because SACs and HCs are born embryonically, they are rarely 
transduced”, allowing us to surround wild-type SACs and HCs with 
cells ectopically expressing excess MEGF10. Indeed, electroporation of 
plasmid encoding a fluorescent protein (FP) or a MEGF10-FP fusion 
produced retinal patches in which a large fraction of neurons, but no 
SACs or HCs, were FP* (Fig. 4a and Supplementary Fig. 9). Expression 
of MEGF10-FP (but not FP alone) resulted in exclusion of SACs and 
HCs from a swath at the edge of the electroporated patch, whereas 
spacing of these cells was essentially normal in patch centres (Fig. 4a, 
b and Supplementary Fig. 10). A truncated MEGF10 lacking the 
cytoplasmic domain produced an identical phenotype (Supplemen- 
tary Fig. 10), ruling out the possibility that MEGF10 acts indirectly 
by triggering production of a repellent factor by the transfected cells. 
Thus, MEGF10 can act as a ligand that signals to SACs and HCs. 

Because MEGF10 relatives have been implicated in cell engulfment'?”’, 
we asked whether its overexpression formed an exclusion zone by 
eliminating cells. We found no evidence for cell death at patch edges 
(no pyknotic nuclei or activated caspase-3 immunoreactivity; data not 
shown). Instead, cell density was increased at the outer edge of the 
cell-free swath, indicating that SACs had exited the patch to create the 
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Figure 3 | HC mosaic spacing requires Megf10 and Megf11. a-d, In situ 
hybridization for Megf11 at ages indicated. Megf11 (red) was not expressed at 
E16 (a). Calbindin immunostaining (green) labels SACs and HCs, which co- 
express Megf11 at P5 (b), P7 (c) and P14 (d). See Fig. 1 for abbreviations. 
e-i, Retinal whole mounts stained for calbindin to reveal the HC array. In wild- 
type mice (e), HCs are distributed evenly. Megf10~~ mutants (f) and Megf11~~ 
mutants (g) show subtle changes in the regular spacing of HCs, whereas double 
Megf10”-; Megf11- mutants (h) show marked HC disorganization similar to a 
simulation of a random HC array (i). j-m, Quantification of HC spacing 
regularity in Megf10’~ (red) or Megfll ‘~ (green) single mutants; Megfl0~ ; 


Megf11 ~~ double mutants (blue); and wild-type (WT) siblings (grey). In all 
genotypes, HCs were present at normal density (j) but were less regularly 
spaced relative to wild type based on exclusion zone radius (k), packing factor 
(1) and Voronoi domain regularity (m) measurements as in Fig. 2. Double 
mutants showed significantly less order than single Megfl0”” or Megfl 1” 
mutants and approach random arrangement, indicated by dashed lines 

(k, mean HC soma diameter; 1, m, computed values for random arrays). P 
values: *P < 0.01, **P < 0.001, ***P < 0.0001. NS, not significant. Error bars 
give s.e.m. Data in e-m from P15 animals. Scale bars, 20 um (a-d; b, c share 
scale) or 50 um (e-i). 
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Figure 4 | MEGF10 acts as both ligand and receptor to trigger SAC 
repulsion. a, A retinal patch transfected by electroporation with plasmid 
encoding MEGF10-FP fusion protein, viewed in flat mount. SAC somata in 
INL, stained with anti-ChAT, are excluded from a swath at the patch edge. 
SACs are evenly spaced elsewhere, except where the retina was pierced to inject 
DNA (Inj. site). b, Higher-magnification views of patch edges. FP 
misexpression (left) did not affect SAC spacing, but MEGF10-FP (right) 
produced a SAC-free zone just inside the transfected region and induced 
apparent crowding of SACs immediately outside it (arrows). Dashed line, patch 
edge. See Supplementary Fig. 11 for quantification of cell distribution at patch 
edges. c, Hypothesis for MEGF10 function based on a, b. In wild-type retina 


SAC-free zone (Fig. 4b and Supplementary Fig. 11). Our interpretation 
is that SACs and HCs are repelled by MEGF10, but that within 
MEGF10-FP patches the concentration of MEGF10 is similar in all 
directions, resulting in no net movement (Fig. 4c). Consistent with this 
view, SACs and HCs were entirely excluded from small patches (=5-6 
cell diameters; Supplementary Fig. 10). Together these results suggest 
that MEGF10 can act as a repellent ligand. 

We next asked which cells are sensitive to MEGF10 ligand. In 
Megf10 mutants, SACs and HCs are selectively affected; this might 
be because they are uniquely sensitive to MEGF10 or because only 
these cells encounter endogenous MEGF10 at high concentration. 
Ectopic expression allowed us to distinguish between these possibilities. 
None of 13 other amacrine, bipolar or retinal ganglion cell subtypes 
assayed with cell-type-specific markers was detectably affected by 
MEGF10-FP (Fig. 4d and Supplementary Table 3). Thus, MEGF10 
appears to act asa cell-surface ligand fora receptor specifically expressed 
by SACs and HCs. 

This led us to ask whether MEGF10 might be the MEGF10 receptor. 
We used cultured epithelial cells (HEK293) to seek evidence for a 
MEGF10-dependent homotypic interaction. Whereas FP-expressing 
cells overlapped, MEGF10-FP-expressing cells formed sharp borders 
with narrow gaps (Supplementary Fig. 12a), consistent with previous 
results’. This ‘jigsaw’ pattern reflected suppression or elimination of 
filopodia at points of intercellular apposition (Supplementary Fig. 12b). 
Jigsaw formation and loss of filopodia required the MEGF10 cytoplasmic 
domain and did not occur when a MEGF10-FP-expressing cell con- 
tacted an untransfected or FP-transfected cell (Supplementary Fig. 
12b, c). Although we have been unable to demonstrate MEGF10 
homophilic binding using biochemical methods (data not shown), 
these results suggest that a MEGF10-containing signalling complex 
mediates a homotypic interaction resulting in intercellular repulsion. 

Finally, we tested the idea that MEGF10 serves as both ligand and 
receptor in vivo. To this end, we electroporated MEGF10 into Megf10 
mutant retina. Mutant SACs did not exit MEGF10-FP patches (Fig. 4e 
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(top), SACs use MEGF10 as a ligand to signal their location to neighbouring 
SACs. Cell 3 positions itself at the point where repulsive signals on either side 
(from Cell 1 and Cell 2) are equal. In MEGF10-FP overexpression (bottom), 
that location is now outside the patch. d, MEGF10-FP transfected whole- 
mount immunostained for ChAT and VGLUTS3. Dashed lines mark SAC-free 
zone (see Supplementary Fig. 10). VGLUT3* amacrine cells are present in this 
zone. e, MEGF10-FP is incapable of generating a SAC-free zone when 
misexpressed in Megf10 mutant retina, indicating that MEGF10 is needed for 
SAC responses to MEGF10. See Supplementary Fig. 11 for quantification. Scale 
bars, 100 um (a); 50 um (b-e). All retinas are from P10-P15 animals. 


and Supplementary Fig. 11b), indicating that Megf10 gene function is 
required for SACs to respond to MEGF10 repulsive signals. Together 
these results suggest a model in which SACs and HCs use MEGF10 as 
part of a receptor complex that detects MEGF10 on their homotypic 
neighbours. This repulsive signal positions their somata so as to equalize 
MEGF10 signals on all sides, thereby creating exclusion zones (Fig. 4c). 

The phenomenon of retinal mosaicism implies a molecular system 
for cell-type-specific recognition. Several potential mechanisms have 
been proposed, based on imaging and computational studies, one being 
that short-range repulsive signals regulate tangential movements to 
establish each cell’s territory***”°. So far, however, direct mediators of 
this phenomenon have not been described. Here we provide evidence 
that, for SACs and HCs, mosaic spacing requires repellent homotypic 
interactions mediated by MEGF10 and 11. Signals initiated by 
MEGF10/11 in growing neurites could lead to repositioning of the 
soma within the cytoplasm, perhaps by mechanisms resembling those 
that direct soma translocation in response to signals at the leading 
process of migrating neurons”. The finding that MEGF10 and 11 
facilitate formation of three independent mosaics (two for SACs and 
one for HCs) appears at first to contradict the idea that mosaics are 
independent of each other and must therefore be regulated by distinct 
molecules’*°. However, because the three mosaics occupy distinct 
cellular planes, they may be exempt from the requirement for molecular 
individuation. Different molecules are likely to mediate homotypic 
interactions in other retinal subtypes, some of which may lead to soma 
translocation and others to the death of cells that violate minimal 
spacing”. 

We note two broader implications of our results. First, Draper/CED-1/ 
MEGF10 homologues have until now been studied predominantly as 
receptors for cell engulfment’?*. Here we show that they also mediate 
cell-cell repulsion and can act as ligands as well as receptors, thereby 
expanding the roles for this gene family. Second, although mosaic 
arrangements have so far been studied formally only in retina, regularly 
arranged neuronal arrays are common features of central neural 


©2012 Macmillan Publishers Limited. All rights reserved 


organization’”’. Mechanisms similar to those described here could be 


involved in promoting this regularity. 


METHODS SUMMARY 


Retinal neurons expressing fluorescent proteins were purified and used to generate 
amplified RNA for hybridizing Affymetrix microarrays as described previ- 
ously'®*’’, SAC-specific genes were identified using dChip software. Megf10 and 
Megf11 mutant mice were produced from constructs generated by the Knockout 
Mouse Project and European Conditional Mouse Mutagenesis program 
(EUCOMM)”. Histological methods'*'” and methods for electroporation of 
plasmid DNA in vivo'’~’ were described previously. HEK293 cells (ATCC) were 
cultured and transfected by standard methods. 

For analysis of spatial statistics, we sampled a 635.9 lum square at 3-4 locations 
per retina. X-Y cell coordinates, marked manually, were used to calculate Voronoi 
domain areas (Fiji) or DRP statistics such as the effective radius (that is, exclusion 
zone) and packing factor (WinDRP). We generated random arrays matched in 
density and soma size to real data; these were analysed in parallel with data from 
mutants. For measurement of SAC crowding in gain-of-function experiments, the 
Delaunay triangulation® was used to define each cell’s nearest neighbours. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animals. CD1 and C57BL/6 mice were obtained from Charles River and Jackson 
Labs. All experiments were carried out in accordance with protocols approved by 
the Harvard University Standing Committee on the Use of Animals in Research 
and Teaching. 

Megf10 mutant mice were produced from a construct provided by the Knockout 
Mouse Project*? (CHORI). Embryonic stem (ES) cell electroporation and 
chimaera production were performed by the Harvard Genome Modification 
Facility. ES cell clones were screened for integration by PCR using primers listed 
in Supplementary Table 4 (also see Supplementary Fig. 2). Megf11 mutant mice 
were generated from gene-targeted ES cells provided by the European Conditional 
Mouse Mutagenesis program (EUCOMM). In both cases germline transmission 
was obtained from two chimaeric mice generated from independent ES cell clones; 
each had indistinguishable phenotypes. The primers used for genotyping Megf10 
and Megf11 mice are listed in Supplementary Table 4. Drd4-GFP mice*! were 
obtained from the Mutant Mouse Regional Resource Center. 

Cell sorting and expression profiling. Retinal neurons expressing fluorescent 
proteins were purified as described'*””. Briefly, P6 retinas were dissociated, live- 
stained with antibodies recognizing cell surface antigens (if required for purifica- 
tion), and then passed through a flow cytometer (Mo Flo; Dako). Positive cells 
were then either: (1) plated, fixed, stained with cell-type-specific markers and 
counted to assess purity; or (2) sorted directly into RNA lysis buffer (PicoPure 
Kit; MDS). Gene expression was profiled using Affymetrix Mouse 430 2.0 arrays, 
following two rounds of linear amplification (MessageAmp II; Applied 
Biosystems). Using these methods we generated a gene-expression database for 
13 specific retinal neuron subtypes (five amacrine, two bipolar and six retinal 
ganglion cell types; see Supplementary Table 2). SAC-specific genes (Supplemen- 
tary Table 1) were identified using dChip software using fold-change over other 
cell types as filtering criteria. 

Histology and staining. Fixation and preparation of retinal cryosections was 
performed as described'’. For whole-mount retinal stains, the retina was dissected 
out of the eyecup in a dish of phosphate buffered saline (PBS), transferred to an 
Eppendorf tube (up to four retinas per tube), and incubated at room temperature 
(20-25 °C) for 1h with agitation in blocking solution (PBS + 0.3% Triton-X-100 
+ 3% donkey serum; Jackson Immunoresearch). Primary antibody, diluted in 
blocking solution, was then applied to the sample; incubation was for 6 days at 
4°C with agitation. Following a 2-3h wash in 2-3 changes of PBS, secondary 
antibodies (Jackson Immunoresearch) were applied overnight at 4 °C with agita- 
tion. After at least 4h of washing in PBS with agitation, retinas were flattened onto 
nitrocellulose membranes (Millipore) and mounted on slides (Fluoromount G; 
Southern Biotech). Images were acquired with a FV1000 confocal microscope 
(Olympus). 

The following antibodies were used for immunostaining: rabbit anti-calbindin 
(Swant); goat anti-ChAT (Millipore); goat anti-VAChT (Promega); guinea pig 
anti-VGLUT3 (Millipore); mouse anti-islet 1/2 (clone 39.4D5, Developmental 
Studies Hybridoma Bank (DSHB)); mouse anti-SYT2 (DSHB); chicken anti-green 
fluorescent protein (GFP; Aves labs); rabbit anti-DsRed (Clontech); rabbit anti- 
B-galactosidase’; mouse anti-glutamine synthetase (BD); sheep anti-tyrosine 
hydroxylase (Millipore); mouse anti-PKC (Abcam); rabbit anti-DAB1 (Millipore); 
rabbit anti-CART (Phoenix); mouse anti-Kv4.2 (Neuromabs); rabbit anti-SOX9 
(Millipore); rabbit anti-cleaved-caspase 3 (Cell Signaling Technology); rabbit 
anti-bNOS (Sigma); rabbit anti-EBF3 (ref. 17); rabbit anti-GAD65/67 (Millipore); 
goat anti-GLYT1 (Millipore); mouse anti-BRN3A (Millipore); mouse SMI-32 
(Sternberger). MEGF10 antibody was raised in rabbits against a His-tagged peptide 
corresponding to amino acids 879-1130 of mouse MEGF10 (Millipore). 

In situ hybridization was performed according to previously reported methods”. 
To make the template for the Megf10 antisense probe, we obtained an IMAGE clone 
(BC075647) containing full-length mouse Megf10 and linearized with ClaI. Megf11 
template, in pGEM-T Easy (Promega), was generated by PCR from brain cDNA 
using primers TGTCTTCCTTCTGCAAGCTGCTCT and ATTCCACAAGT 
GCCTGGTGAGTGT. Antisense riboprobes were transcribed using the DIG 
RNA labelling kit (Roche). Calbindin immunostaining was used to label SACs 
and HCs—at PO-P7 it is a selective marker for these two cell types, showing 
additional expression in only a small subset of amacrine and retinal ganglion cells 
(Fig. 1 and Supplementary Fig. 9, and data not shown). 

Labelling of single SACs and HCs was performed by intraocular injection of 
low-titre adeno-associated virus, as described*®. 

Quantification of mosaic regularity. For analysis of mutant phenotypes, confocal 
z-stacks through the GCL and INL were acquired from P14-17 retinas stained with 
antibodies to ChAT, calbindin, tyrosine hydroxylase, and/or VGLUT3. Each retina 
was sampled in 3-4 locations (sample size was a square, 635.9 [1m on each side). 
Stacks in which mounting artefacts introduced large local z-axis displacement of the 
SAC or HC array were not used. We sampled from both central and peripheral 


retina; despite differences in cell density between these locations the spatial 
organization of SAC, HC and VGLUT3 arrays did not appear to differ systematically. 
We therefore pooled data from central and peripheral retina for subsequent analysis. 
Using Fiji software, the centre of each cell was marked manually to generate X-Y 
coordinates. The point array was flattened to a single plane without correction for 
local curvature, but as we only used very flat images for analysis, the spatial dis- 
tortions introduced by this procedure were minor. Density recovery profiles 
(DRPs) were computed from these X-Y coordinates using WinDRP software™*’. 
Voronoi domain areas were computed in Fiji and Microsoft Excel from the X-Y 
coordinate data. SAC, HC and VGLUT3* amacrine cell density and cell diameters 
were determined from these same images using Fiji to perform measurements. 

We calculated three measures of regularity for each image. First, from the DRP, 
we obtained the effective radius. Effective radius gives the size of the exclusion 
zone—the zone in which another cell is less likely to be found than would be 
expected for a random array”’. Second (also from the DRP), we computed the 
packing factor—a regularity index that ranges between 0 (for a random array) 
and 1 (for a perfect hexagonal array). Third, we calculated the Voronoi domain 
regularity index (VDRI) by dividing the mean Voronoi domain area for a given 
cellular array by the standard deviation of those areas”’**”». 

To ascertain how the measured packing factor and VDRI compared to those 
that would be observed in cell arrays lacking mosaic spacing, we generated 
random simulations of HC, SAC, and VGLUT3* amacrine cell arrays. These 
simulations placed cells randomly in a 635.9 um square following a Poisson point 
process, until the density of the array equalled the mean density of the cell type in 
question. The only constraint on cell location in these simulations was soma 
diameter, which was calculated for each cell type as described earlier. The soma 
diameters used to constrain the simulations were 9.0 um (VGLUT3* amacrine 
cells), 11.0 um (SACs) and 12.2 4m (HCs). Programita software*® was used to 
generate the simulations (n = 10 for each cell type). Packing factor and VDRI 
were then calculated as described above, and the means were plotted as dashed 
lines in Figs 2 and 3. The VDRIs for our random simulations were similar to those 
calculated previously”. 

For analysis of SAC cell position in MEGF10-misexpressing retina, we acquired 
images at the edge of FP” or MEGF10-FP* misexpressing patches, and used the 
above method to obtain X-Y coordinates of all SACs in the field of view. SACs 
inside the patch, outside the patch, and at the edge of the patch were marked 
separately. ‘Edge’ cells were defined as those outside the FP* region for which the 
shortest line drawn from that cell to the edge of the FP" region did not pass the 
soma of another SAC. To ask whether SACs were present at higher density at patch 
edges, we calculated the distance from each SAC to its nearest neighbours. 
Neighbours were defined in an unbiased manner by computing the Delaunay 
triangulation for the X-Y location data set (Fiji), thereby defining line segments 
from each cell to its nearest neighbours. 

Statistical analysis. For analysis of exclusion zone radius, packing factor, VDRI 
and cell density, the significance of measured differences between genotypes was 
evaluated by the Mann-Whitney U test. Sample sizes were =9 images from =3 
retinas per genotype (SAC and HC analysis) or 6 images from 2 retinas per genotype 
(VGLUT3 analysis). In double-mutant experiments on HCs, the Holm-Bonferroni 
correction for multiple comparisons (four different genotypes) was applied when 
determining significance level. For measurement of soma diameter, sample size 
was = 150 cells for each genotype. 

Gain-of-function experiments. A Megf10 expression construct was generated by 
PCR amplifying the open reading frame (ORF), with stop codon deleted, from the 
IMAGE clone described earlier. The open reading frame, which was predicted to 
encode a protein equivalent to the Megf10 RefSeq sequence (NP_115822.1), was 
TA-cloned into the Gateway entry vector pCR8GW-Topo (Invitrogen). Primers 
used for cloning were CGATTGTTCTTCACAGAACATGGCG and TTGAT 
GTGATTCACTGCTGCT. A cytoplasmic domain-deletion construct was made 
by amplifying with reverse primer TGATTCCTTCCTCTTCTGCTT to generate a 
truncated protein carrying only the first nine amino acids of the intracellular 
domain. For expression, these constructs were transferred to a Gateway destina- 
tion vector bearing the ubiquitin-C promoter and an in-frame C-terminal GFP or 
monomeric Cherry tag. Mouse Megf11 was also cloned, but pilot experiments in 
HEK cells suggested a lack of surface expression, so we did not attempt in vivo 
experiments with mouse Megf11. 

In vivo electroporation was performed as described'’”’. Briefly, plasmid DNA 
(at least 1.5 mg ml ') was injected into the subretinal space of neonatal mice (4— 
36h postpartum), and current pulses (80 V) were applied across the head using 
paddle electrodes. We obtained identical results in MEGF10-GFP and MEGF10- 
Cherry misexpression experiments. Sample sizes were >20 animals each for 
MEGF10 misexpression and FP controls. 

HEK 293 cells were cultured in DMEM with 10% fetal calf serum and trans- 
fected using TransIT reagent (Mirus). Cells were counterstained using Alexa 
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Adaptive radiation of multituberculate mammals 
before the extinction of dinosaurs 


Gregory P. Wilson’, Alistair R. Evans’, lan J. Corfe*, Peter D. Smits'?, Mikael Fortelius?* & Jukka Jernvall® 


The Cretaceous-Paleogene mass extinction approximately 66 million 
years ago is conventionally thought to have been a turning point in 
mammalian evolution’. Prior to that event and for the first two- 
thirds of their evolutionary history, mammals were mostly confined 
to roles as generalized, small-bodied, nocturnal insectivores’, 
presumably under selection pressures from dinosaurs*. Release 
from these pressures, by extinction of non-avian dinosaurs at the 
Cretaceous-Paleogene boundary, triggered ecological diversification 
of mammals'”. Although recent individual fossil discoveries have 
shown that some mammalian lineages diversified ecologically during 
the Mesozoic era’, comprehensive ecological analyses of mammalian 
groups crossing the Cretaceous—Paleogene boundary are lacking. 
Such analyses are needed because diversification analyses of living 
taxa°’ allow only indirect inferences of past ecosystems. Here we show 
that in arguably the most evolutionarily successful clade of Mesozoic 
mammals, the Multituberculata, an adaptive radiation began at least 
20 million years before the extinction of non-avian dinosaurs and 
continued across the Cretaceous-Paleogene boundary. Disparity in 
dental complexity, which relates to the range of diets, rose sharply in 
step with generic richness and disparity in body size. Moreover, 
maximum dental complexity and body size demonstrate an adaptive 
shift towards increased herbivory. This dietary expansion tracked 


Ctenacodon 


Ptilodus 


Occlusal 


the ecological rise of angiosperms® and suggests that the resources 
that were available to multituberculates were relatively unaffected 
by the Cretaceous—Paleogene mass extinction. Taken together, our 
results indicate that mammals were able to take advantage of new 
ecological opportunities in the Mesozoic and that at least some of 
these opportunities persisted through the Cretaceous-Paleogene 
mass extinction. Similar broad-scale ecomorphological inventories 
of other radiations may help to constrain the possible causes of mass 
extinctions”. 

Multituberculate mammals were a taxonomically rich*"’ and 
numerically abundant’’ clade that had originated by the Middle 
Jurassic epoch (approximately 165 million years (Myr) ago) and went 
extinct in the late Eocene (approximately 35 Myr ago)*''. They were 
nearly globally distributed’ and had a distinctive dentition consisting 
of procumbent incisors, blade-like premolars, molars with longitudinal 
rows of cusps (Fig. 1) anda predominantly posteriorly directed (palinal) 
chewing motion'*”. 

Palaeontologists have agreed for a long time that the success of 
multituberculate mammals was at least partly related to their highly 
derived dentition. Despite this, there is little consensus on the inter- 
pretations of their feeding ecology, perhaps owing to the limitations of 
previous approaches. For example, toothwear analysis is time intensive 
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Figure 1 | 
Dental and dietary diversity in multituberculate mammals. a—d, Three- 
dimensional buccal-occlusal and occlusal reconstructions of multituberculate 


lower-right cheek tooth rows for GIS analysis: Late Jurassic plagiaulacid 
Ctenacodon serratus (a), Paleocene ptilodontoid Ptilodus kummae (b), Late 
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Cretaceous cimolomyid Cimolomys gracilis (c), Paleocene taeniolabidid 
Taeniolabis taoensis (d). Surface orientation map of each three-dimensional 
reconstruction (colour wheel indicates orientation) for OPC measurements 
(shown by the number of coloured patches). Clumps that are smaller than three 
grid points (black) are ignored. p, premolar; m, molar. Scale bars, 1 mm. 
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and requires high-quality preservation’, and multituberculates lack 
living descendants and this hampers comparative studies. Inter- 
pretations of multituberculate feeding ecology therefore vary widely; 
they have been proposed to be broad herbivores, frugivores, granivores, 
root- and bark-eaters, egg-eaters, insectivores, carnivores and 
omnivores'*"”, 

To obtain a robust and comprehensive view of multituberculate 
ecomorphological diversity through time, we quantified dental com- 
plexity in 41 genera using geographic information systems (GIS) 
analyses'* of three-dimensional crown surfaces of lower cheek teeth 
(Fig. 1; Supplementary Table 1). These analyses do not require cusp 
and facet homologies to be established, which can be a challenging task 
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when comparing morphologically and phylogenetically divergent taxa. 
Orientation patch count (OPC), a measure of dental complexity, was 
calculated as the number of discrete surfaces on the cheek tooth row 
distinguished by differences in orientation (for example, north, 
southwest; Fig. 1). Extant rodents, carnivorans and bats demonstrate 
a robust correlation between OPC and feeding ecology; OPC increases 
across the dietary spectrum from carnivores to omnivores to herbivores, 
despite many differences in specific tooth components, body size 
and chewing mechanics among these taxa’*’’. For a given clade, the 
standard deviation of OPC is an effective proxy for the dietary diver- 
sification and divergence in feeding function: higher standard devi- 
ation means greater dental disparity and a broader range of diets. OPC 
thus offers promise as a powerful tool for quantifying overall tooth 
shape and inferring diet in extinct taxa, such as multituberculates, that 
have highly derived dentitions with uncertain homology with living 
mammals and imprecise functional analogy. 

Among the earliest multituberculates, the ‘Plagiaulacida’ are a 
paraphyletic assemblage of taxa with up to four simple blade-like 
premolars and two multi-cusped molars (Fig. 1a). OPC analyses of 
‘plagiaulacids’ from the Late Jurassic through to the Early Cretaceous 
epoch (from 156-100 Myr ago) show low and tightly constrained 
dental complexity (Fig. 2a; OPC, 84-125). Their OPC values corre- 
spond to carnivory and the low end of animal-dominated omnivory 
among extant mammals (for example, eating both insects and fruits). 
Multituberculates in the early Late Cretaceous (100-84 Myr ago), 
which include mostly basal members of the suborder Cimolodonta, 
had a slightly higher mean OPC than did the ‘plagiaulacids’ but 
retained the low standard deviation of OPC (Fig. 2a), indicating low 
morphological disparity. A distinct break occurred 84-66 Myr ago, in 
the latest Cretaceous, when the mean OPC rose and peaked within the 
Campanian (mean OPC, 145) and maximum OPC and disparity 
sharply increased as well (Fig. 2a; OPC, 70-230). Of the 17 taxa for 
this interval, 5 have OPC values that are greater than 160 and 2 have 
OPC values that are greater than 200, corresponding to values for 
plant-dominated omnivory and herbivory among extant mammals, 
respectively (Fig. 2a). 

Finally, in the early Paleocene (66-62 Myr ago) multituberculates 
maintained high OPC (mean OPC, 138) and disparity peaked 
(Fig. 2a; OPC, 70-348). The early Paleocene Taeniolabis from North 
America (Fig. 1d) had the highest OPC among multituberculates (OPC, 
348), which exceeded OPC values of extant herbivorous rodents and 
carnivorans'’. Of the 16 other early Paleocene taxa, 4 have OPC values 
greater than 160 and 1 has an OPC value greater than 200 (Fig. 2a). 
Mean OPC and disparity of OPC declined during the remainder of the 
Paleocene and in the early Eocene (62-49 Myr ago). Ectypodus, the 
only known genus from the middle to late Eocene (49-35 Myr ago), has 
alow OPC (109), corresponding to the high end of the range for extant 
mammalian carnivores. This sharp drop in dental ecomorphological 


Figure 2 | Temporal patterns of multituberculate dental complexity, 
taxonomic richness, body size and angiosperm ecological diversification. 
a, Dental complexity (measured by OPC) for 41 multituberculate genera (solid 
black lines) and estimated for 24 additional genera (dashed black lines; see 
Supplementary Information) with mean dental complexity (OPC; red line) and 
disparity as standard deviations of OPC (blue line) in 5-Myr bins. Lengths of 
horizontal lines represent temporal ranges of taxa or uncertainties in ages of 
fossil localities. Labels for dietary classes are positioned at the lower end of their 
range based on OPC values in modern mammals'*. Green shaded area 
represents plant-dominated omnivory (PD omni) and herbivory. 

b, Taxonomic richness is equal to the number of genera per 5-Myr bin (blue 
shaded area). c, Body-mass estimates for 156 multituberculate species (solid 
black lines) with geometric means (red line) and disparity (blue line) for each 
5-Myr bin. d, Angiosperm within-flora richness (solid green line) and relative 
abundance (dashed green line) from ref. 24 (+7.5-Myr moving averages). Leaf 
hydraulic capacity of angiosperms as leaf-vein density (green circles) from ref. 
25. The 136-131-Myr bin was excluded from all analyses (Supplementary 
Information). AD omni, animal-dominated omnivory; E, early; Hypercarn, 
hypercarnivory; M, middle; L, late; O, Oligocene; P, Paleocene. 
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disparity is intriguing in light of the hypothesis that some late 
Paleocene to early Eocene eutherian lineages (for example, rodents) 
competitively displaced multituberculates”. 

Variable sampling of fossils through time may influence our 
inferred patterns of multituberculate radiation. In particular, the rela- 
tively depauperate Jurassic and Early Cretaceous taxonomic richness 
of multituberculates may be partly due to the limited fossil record’. To 
test whether the changes in OPC through time might be sensitive to 
uneven sampling of fossils, we randomized the OPC value assignments 
for each genus. The results of 5,000 randomized mean OPC profiles 
through time indicate that even with additional fossil discoveries, the 
OPC patterns and ecological inferences that are presented here are 
likely to be robust (see Supplementary Fig. 8). 

The overall trend of increasing multituberculate dental complexity 
was driven by increases in the number of cusps per molar and the 
relative size of molars in the cheek tooth row (Supplementary Fig. 1). 
This evolutionary pathway differs from that taken by ungulates, 
primates, rodents and lagomorphs, which responded to increased 
mechanical processing demands (that is, greater plant component) 
by evolving multi-cusped, molar-like premolars in addition to 
complex molars”. Among several other mammalian lineages with 
blade-like premolars (for example, carpolestid primates and potoroid 
marsupials), only sthenurine marsupials re-evolved multi-cusped 
premolars”. Developmental, functional or structural constraints that 
are associated with the evolution of a blade-like premolar may have 
inhibited later evolution of more cuspidate premolars. 

Patterns of generic richness and conservative estimates of body size 
for multituberculates (see Methods Summary) mirror the pattern of 
initially low dental complexity for most of the Mesozoic followed by a 
marked increase just before the end of the Cretaceous (84-66 Myr ago; 
Fig. 2b, c). From the Late Jurassic to the early Late Cretaceous, 
multituberculate generic richness fluctuated between 4 and 16 genera, 
and body mass ranged between 9 g and 105 g. A marked shift occurred 
near the end of the Cretaceous, with generic richness increasing from 
16 to 21 genera and average body mass increasing from 67 g to 194 g, 
in particular, Bubodens magnus reached an estimated 5.25 kg (for a 
comparison, the alpine marmot weighs 3-8 kg). Similarly, Cenozoic 
generic richness and body size patterns parallel changes in dental 
complexity (Fig. 2b, c). However, it should be noted that body size 
and dental complexity are not interchangeable as predictors of diet 
because small multituberculates can have high OPC values, and large 
multituberculates can have low OPC values (Supplementary Informa- 
tion), conforming to previous analyses of extant taxa!®. Furthermore, 
variation in the amount of available fossil-bearing rock through the 
studied interval could contribute to the pattern of generic richness”. 

It is worth noting that increases in dental complexity (OPC of greater 
than 160) occurred in five multituberculate lineages: in the Asian 
Djadochtatheroidea, the North American Eucosmodontidae and 
Cimolomyidae (just before the end of the Cretaceous), the North 
American and Asian Taeniolabididae, and the North American 
Microcosmodontidae (in the early Paleocene). In the absence of a 
robust multituberculate phylogeny, it is unclear whether this repre- 
sents a single evolutionary increase in tooth complexity, with multiple 
reversals to low complexity, or parallel increases in individual lineages 
or clades. This increase differs from the more commonly observed 
pattern, in which increased morphologic disparity precedes taxonomic 
diversification”’, as this increase occurred in step with an increase in 
multituberculate body size and taxonomic richness. 

Overall, the pattern of increasing dental complexity in multi- 
tuberculate mammals that pre-dates the Cretaceous—Paleogene mass 
extinction event contrasts with conventional ideas that mammalian 
evolution was suppressed during the Mesozoic era by selective pressures 
imposed by dinosaurs. Instead, multituberculates, the mammalian clade 
that co-existed for the longest time with non-avian dinosaurs, initiated 
an evolutionary radiation during the acme of dinosaur diversity, 
approximately 20 Myr before the Cretaceous—Paleogene boundary”. 
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This is consistent with the highly specialized adaptations that are found 
among several new, exceptionally well-preserved Mesozoic mammal 
specimens’ and with the timing of increased molecular divergence 
rates among extant mammalian lineages®’, but the multituberculates 
also show broad taxonomic and ecomorphologic diversification at the 
intraordinal level. 

This adaptive response post-dated the taxonomic radiation of 
angiosperms but coincided broadly with increases in ecological diversity, 
abundance and leaf hydraulic capacities of angiosperms in the Late 
Cretaceous (in the Campanian and Maastrichtian)*’** (Fig. 2d), 
suggesting that there is a causal link. Many angiosperms during this 
time were herbaceous, and had a rapid life cycle and less-effective 
herbivore defences compared to other seed plants and would con- 
sequently have been an attractive, protein-rich food source for 
herbivores”’. Some angiosperms had begun inhabiting a broader range 
of niches (trees, herbs and epiphytes) than most other plant groups and 
this may have enabled greater partitioning of the herbivore niche. 
Therefore, the multituberculate adaptive radiation, suggested by the 
increasing dental complexity, may have been triggered mainly by new 
niche space that was generated through the evolutionary and ecological 
radiations of angiosperms, but may have also been influenced by 
parallel radiations of associated non-angiosperm clades (for example, 
ferns’’), insect pollinators, dispersers and herbivores”*. A trophic link 
between angiosperms and some multituberculates is supported by our 
inferred dietary trend towards increased plant-dominated omnivory 
and herbivory among multituberculates just before the end of the 
Cretaceous and in the early Paleocene. 

Our data also show that the dietary range of multituberculates did 
not decrease in response to the Cretaceous—Paleogene mass extinction 
event (Fig. 2a). Despite substantial taxonomic turnover of multi- 
tuberculates at the Cretaceous—Paleogene boundary”, they seem to 
have experienced little change in available food resources during one 
of the most severe extinction events in Earth’s history. This apparent 
indifference of the multituberculate radiation to the Cretaceous— 
Paleogene event underscores the ecological selectivity of extinctions 
and suggests that broad-scale ecomorphological inventories of radia- 
tions may help to constrain the possible causes of extinction for other 


groups at the Cretaceous-Paleogene boundary””®. 


METHODS SUMMARY 


We scanned 48 dentitions from 41 genera of multituberculates. The three- 
dimensional scans of lower cheek tooth rows were analysed using GIS to quantify 
the number of discrete orientation patches using eight orientation directions. The 
mean of repeated measurements for eight rotations at multiples of 5.625° was used 
to reduce the effect of slight variations in the orientation of teeth. We compiled 
generic richness data from recent compendia and the primary literature. We 
estimated body mass from skull length, whenever available, and a tooth size to 
skull length regression formula. Detailed methods and calculations of dental com- 
plexity, generic richness and estimated body mass, data sets, measurements, cal- 
culations, randomization analyses and additional references are provided in the 
Supplementary Information. 
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Identification and characterization of a bacterial 
hydrosulphide ion channel 


Bryan K. Czyzewski"* & Da~-Neng Wang’”* 


The hydrosulphide ion (HS ) and its undissociated form, hydrogen 
sulphide (H,S), which are believed to have been critical to the origin 
of life on Earth’, remain important in physiology and cellular sig- 
nalling’. As a major metabolite in anaerobic bacterial growth, hydro- 
gen sulphide is a product of both assimilatory and dissimilatory 
sulphate reduction” *. These pathways can reduce various oxidized 
sulphur compounds including sulphate, sulphite and thiosulphate. 
The dissimilatory sulphate reduction pathway uses this molecule as 
the terminal electron acceptor for anaerobic respiration, in which 
process it produces excess amounts of H,S (ref. 4). The reduction 
of sulphite is a key intermediate step in all sulphate reduction 
pathways. In Clostridium and Salmonella, an inducible sulphite 
reductase is directly linked to the regeneration of NAD*, which 
has been suggested to have a role in energy production and growth, 
as well as in the detoxification of sulphite’*. Above a certain con- 
centration threshold, both H,S and HS inhibit cell growth by 
binding the metal centres of enzymes and cytochrome oxidase’, 
necessitating a release mechanism for the export of this toxic 
metabolite from the cell°°. Here we report the identification of a 
hydrosulphide ion channel in the pathogen Clostridium difficile 
through a combination of genetic, biochemical and functional 
approaches. The HS” channel is a member of the formate/nitrite 
transport family, in which about 50 hydrosulphide ion channels 
form a third subfamily alongside those for formate’®" (FocA) and 
for nitrite’? (NirC). The hydrosulphide ion channel is permeable to 
formate and nitrite as well as to HS” ions. Such polyspecificity can 
be explained by the conserved ion selectivity filter observed in the 
channel’s crystal structure. The channel has a low open probability 
and is tightly regulated, to avoid decoupling of the membrane 
proton gradient. 

Because H_S is a weak acid with a pK, of 6.8 anda second pK, of 19, in 
a biological system more than 80% of it will exist in its ionized form as 
HS (refs 2, 13). To account for membrane permeation of intracellularly 
produced H,S (refs 6, 9), a channel for HS or H;S has been suggested’, 
with the water channel aquaporins as possible candidates*. Whereas 
studies in planar lipid bilayers indicate that the aquaporin from 
Archaeoglobus fulgidus is not permeable to H,S and that the lipid bilayer 
provides little resistance to HS permeation’, in the vent worm Riftia 
pachyptila it has been shown that HS“ ions, but not H,S, are selectively 
transported through its outer epithelium into its vasculature’. 

Despite the high permeability of H2S across the lipid bilayer, we 
argue that its weak acidity necessitates a release mechanism for its 
ionized form, HS” (Fig. la). As a weak acid with a pK, of 6.8, H2S 
can diffuse readily across the lipid bilayer. Therefore, the distribution 
of H,S and HS” on the two sides of the cell membrane is directly 
proportional to the pH differential as described by the Henderson- 
Hasselbalch equation™*. Because cellular pH is kept at neutral levels, 
and extracellular pH is typically one to two units more acidic, at 
equilibrium the HS” concentration will be 10-100-fold greater on 
the inside of the cell than on the outside in the absence of a release 


mechanism for the HS” anions. Therefore, given its significant 
toxicity, it would be beneficial to directly expel this ion across the 
membrane by the quickest mechanism available: an ion channel. 
Recently, a novel family of anion channels for short-chain acids has 
been functionally characterized’*!°"°. Members of the formate/nitrite 
transport (FNT) family transport various anions during anaerobic 
bacterial growth’*'*”. Judging from sequence homology, however, 
only about half of its 2,000 identified members clearly belong to either 
the FocA or the NirC subfamily (Figs 1b, c and Supplementary Figs 1 
and 2). Notably, for both FocA and NirC, each channel gene is 
genetically linked to its reductase partner: focA to the formate reductase 
pflB’° and nirC to the nitrate reductase nirBD’” (Fig. 1b). Our phylo- 
genetic analysis of bacterial genes suggests the existence of additional 
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Figure 1 | Genetic analyses and functional characterization of FNT3 and the 
asrABC operon. About 50 FNT genes from Clostridium are linked to the 
sulphite reductase gene asrABC, and they form a third subfamily (FNT3) in the 
FNT family. a, Model of the intracellular anion concentrative effect for the weak 
acid H2S. b, Genomic organization of three types of FNT gene and their 
metabolically related reductase genes. Representatives are shown for focA, nirC 
and FNT3 with their respective linked operons. c, Phylogenetic tree of 474 
bacterial and archaeal members of the FNT family. Branches are coloured on 
the basis of genetic linkage to metabolic enzymes: genes linked to pyruvate 
formate lyase (pflBA) or formate dehydrogenase (fdhAB; fdhA and fdhB also 
known as selA and selD, respectively) are coloured pink, genes linked to nitrite 
reductase (nirBD) are coloured blue and genes linked to sulphite reductase 
(asrABC) are coloured green. The FocA protein in archaea is encoded by fdhC 
(selC). The grey areas represent FNT family members with no assigned function 
based on genetic linkage. d, Bismuth sulphite agar plate assay: vector control 
(top); asrA, asrB and asrC (left); asrA, asrB, asrC and FNT3 (right). 
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channel subfamilies in the FNT family (Fig. 1c and Supplementary 
Fig. 1). One such subfamily is a group of ~50 homologous genes, 
temporarily termed FNT3, found in various species of Clostridium that 
are major human pathogens grown under strictly anaerobic conditions. 
As it is linked to a reductase gene, the asrABC operon’® (Fig. 1b), we 
proposed that FNT3 genes might encode a channel for HS or related 
ions. AsrABC reduces sulphite (SO; 7) to sulphide (S-?), which is 
rapidly converted into HS and HS under physiological conditions. 

Salmonella asrABC is the only such operon that has been characterized 
thus far. Its overexpression in Escherichia coli, which lacks this bio- 
chemical pathway, was previously shown to reduce SO; * to H)S 
(ref. 19). To verify first whether the homologous asrABC operon from 
C. difficile also encodes an SO3 7 reductase, we tested the growth of 
E. coli transformed with the C. difficile asrABC genes, plated on bismuth 
sulphite agar’”°. When C. difficile asrABC was induced, E. coli was able 
to overcome the expected growth inhibition by reducing SO; * to H,S 
(Fig. 1d and Supplementary Fig. 3). Darkening along the edges of the 
colonies, formed by the precipitation of bismuth and iron sulphide, 
further indicated that H,S gas was being produced. This showed that 
the asrABC genes from C. difficile do indeed encode for an SO3 7 
reductase, which in turn supports the notion that the FNT3 gene 
(‘FNT3’) in the asrABC operon may encode a channel for SO; 7 or 
for its reduced product, HS. 

To determine whether the FNT3 protein from C. difficile functions 
as a channel for HS” or SO; 7, we then carried out whole-cell trans- 
port, ion-protein binding in solution and transport-inhibition assays 
in reconstituted proteoliposomes. First we transformed sulphide- 
producing Salmonella typhimurium, which lacks an HS” channel, 
with the C. difficile FNT3. In SO, *-supplemented minimal medium, 
the FNT3-expressing S. typhimurium released a high concentration of 
HS" ions into the culture medium (Fig. 2a and Supplementary Fig. 4). 
This increase in extracellular HS” was specific to HS” produced by the 
endogenous cytoplasmic AsrABC; when the other sulphide-producing 
reductase in the cell, the periplasmic thiosulphate reductase, was 
engaged with its substrate thiosulphate, little effect on the extracellular 
concentration of HS” was observed. This indicates that the measured 
HS” was generated by SO; * reduction by the cytoplasmic AsrABC 
and then exported. Such an FNT3-linked increase in HS” production 
in S. typhimurium can be attributed to either increased import of 
SO; ~ or increased export of HS” from the cell. It follows that an 
ENT3 protein could be an ion channel for SO; * or HS. 

An ion channel protein is often stabilized by its permeating ions 
through their direct interaction with its selectivity filter’. Therefore, 
we tested whether purified FNT3 protein interacts with the monovalent 
HS~ or the divalent SO; *. Using size-exclusion chromatography of 
purified protein samples incubated at different temperatures, we found 
that at pH 8.0 the presence of HS ions was able to increase the nominal 
melting temperature of the FNT3 protein by 8 °C, whereas SO; ~ had 
little effect (Fig. 2b and Supplementary Table 1). This result is consistent 
with the hypothesis that FNT3 protein functions asan HS channel but 
is probably impermeable to SO; ~. Notably, three other monovalent 
anions, formate, nitrite and chloride, also increased the melting tem- 
perature of the FNT3 protein, but to a lesser degree (3-4 °C). 

We went on to identify the permeable ions of the FNT3 protein 
in vitro. As channels in the FNT family are expected to have low 
conductance rates'', we measured the transport activity of FNT3 
protein using a concentrative uptake assay''””. Given that HS” ions 
at concentrations needed to set up a sufficient electrochemical gradient 
(~150mM) are severely disruptive to the membrane, we chose to 
measure the permeability of FNT3 protein using another interacting 
ion, formate (Fig. 2b). Proteoliposomes reconstituted at pH 8.0 from 
purified C. difficile FNT3 protein were found to be permeable to formate, 
although at a lower rate than FocA from Vibrio cholerae (Fig. 2c). This 
permeability to formate by FNT3 protein allowed us to search for other 
ions that directly compete with formate for transport. Indeed, when 
HS or nitrite was present in the buffer outside the proteoliposomes, 
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Figure 2 | Binding and transport activity of HSC in reconstituted 
proteoliposomes. a, Measurements of sulphide concentrations in the media of 
Salmonella transformed with vector control or vector encoding HSC. Minimal 
media were supplemented with either sulphite or thiosulphate to induce the 
production of hydrogen sulphide from either periplasmic thiosulphate 
reductase or cytoplasmic sulphite reductase. OD, optical density. b, Binding of 
detergent-solubilized and purified HSC to various anions was determined by 
using thermostability coupled size-exclusion chromatography. Peak heights of 
recovered samples are plotted against temperature and fitted to a Boltzmann 
sigmoidal model to determine nominal melting temperatures. c, Radiolabelled 
formate uptake in proteoliposomes reconstituted with purified HSC at pH 8.0 
was monitored in a concentrative uptake assay and compared with FocA 
activity or vesicle controls. c.p.m., counts per minute. d, Inhibition of 
radiolabelled concentrative uptake of formate by the addition of various anions 
at increasing concentrations. The graph shows the amount of radiolabelled 
formate measured at the 10-min time point for each concentration of anion 
tested. The concentrations of the competing anions were 0, 0.15, 0.6, 3 and 
15 mM (left to right). Error bars, s.e.m. (N = 3). 


the uptake of formate was drastically inhibited (Fig. 2d). In the physio- 
logical sulphide concentration range (1-2 mM), ~70% inhibition was 
observed, whereas at the highest concentration tested (15 mM), HS — 
and nitrite inhibited the uptake of formate by 90% and 70%, respectively. 

Such direct competitive inhibition (Fig. 2c), coupled with whole- 
cell transport assays (Fig. 2a) and in vitro binding assays (Fig. 2b), 
collectively indicates that the FNT3 protein from C. difficile is likely 
to be a channel for the hydrosulphide ion, HS. Although the channel 
is also permeable to formate and nitrite ions, the FNT3 gene is directly 
linked to the asrABC sulphite reductase operon (Fig. 1b); we have 
therefore tentatively named the FNT3 protein a hydrosulphide ion 
channel (HSC). A definite understanding of the physiological role of 
these channels will need to be confirmed by in vivo experiments such 
as genetic deletion studies. 

To understand the transport mechanism of the HSC protein from 
C. difficile (Supplementary Fig. 5), we determined its crystal structure 
at high pH (pH 9.0) toa resolution of 2.2 A (Fig. 3a and Supplementary 
Table 2). The HSC adopts the aquaporin/FocA fold’ and forms a 
pentamer like FocA, with the five protomers having the same structure 
(Supplementary Figs 6-8). The two-fold inverted symmetry between 
the two halves of the protein is greater than that observed in FocA and 
extends to the pore-lining transmembrane helices TM2b and TM5b 
(Supplementary Fig. 8b), two salt-bridge triads on the periplasmic and 
the cytoplasmic sides (Lys 148-Glu 200-Asn 205 and Lys 16-Glu81- 
Asn 86) (Fig. 4a—c), and the two short helices parallel to the membrane 
(helix P on the periplasmic surface and helix N at the amino terminus 
on the cytoplasmic surface) (Supplementary Fig. 8b). This helix N, by 
means of the Lys 16-Glu 81-Asn 86 salt-bridge triad, buttresses TM2b 
and TMSa, making the cytoplasmic side of the protein more closed and 
rigid than that in FocA”’. 
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Figure 3 | Structural and functional characterization of the ion permeation 
pathway. a, Structure of a protomer of HSC overlaid with the pore diameter 
calculations made using HOLE, coloured to indicate permeability to water: 
green, permeable; red, impermeable. Transmembrane helix 2 has been removed 
for clarity. b, Pore diameter calculations from HOLE of the crystal structures for 
each of the two permeation pathway mutations, Leu 82 Val and Phe 194 Ile and 
wild type (WT). c, Close-up of the electron density observed at the selectivity 
filter for the Leu 82 Val mutant. The 2F, — F, electron density map contoured 
at 1.1o is coloured purple and the F, — F, difference density map contoured at 
3a is coloured green. d, Relative uptake of proteoliposomes reconstituted with 
purified HSC and pathway mutations. The graph shows the 10-min time point 
of each concentrative uptake experiment. Error bars, s.e.m. (N = 3). 


The pore for ion permeation, which is located at the centre of each 
protomer, contains the selectivity filter formed by a cytoplasmic slit 
and a periplasmic ring in the middle of the membrane (Fig. 3a, b). The 
radius of the cytoplasmic slit is 0.8 A, which is too small for an HS~ ion 
(radius, 1.7 A; ref. 24) to pass through (Supplementary Figs 9-11). 
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Figure 4 | Structural and functional characterization of possible gating 
mechanisms. a, Structure of the HSC protomer representing possible gating 
regions. The structure is coloured to show the two-fold inverted topology. 

b, c, The Glu-Lys-Asn salt-bridge triads, related by a pseudo-two-fold 
symmetry, help to stabilize helix P at the periplasmic side (b) and helix N at the 
cytoplasmic side (c) of the protein. The rotamer that each residue adopts is 
conserved. d, Relative uptake of proteoliposomes reconstituted with purified 
HSC and gating mutations. The graph shows the 10-min time point of each 
concentrative uptake experiment. Error bars, s.e.m. (N = 3). 
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Therefore, the HSC protein in the crystal structure is probably in a 
closed state. As expected from the sequence similarity between HSC 
and FocA in the selectivity filter (Supplementary Fig. 2), the positions 
or orientations of key residues changed only slightly in HSC 
(Supplementary Fig. 10). Such conservation explains the permeability 
of HSC to HS, as well as to formate and nitrite. When a residue at the 
cytoplasmic slit or the periplasmic ring is mutated into a smaller 
residue, for example Leu 82 to Val or Phe 194 to Ile, the pore size is 
increased, as shown in the crystal structures of the mutant protein 
(Fig. 3b and Supplementary Table 2), and the formate transport rate 
of the channel is significantly increased (Fig. 3d). We note that in the 
crystal structure of the Leu82Val mutant, electron density was 
observed in the cytoplasmic slit (Fig. 3c). This density, which is greater 
than that of a water molecule—as inferred from a B-factor fourfold 
lower than those of the surrounding atoms—could be accounted for 
by a Cl or an HS ion. Owing to their similar sizes and chemical 
properties, it has been previously observed that Cl” and HS” ions 
often bind to the same site in protein crystal structures”. 

The observed polyspecificity of HSC is typical for anion channels 
and is most probably shared by the FocA and NirC channels as well. 
Such polyspecific anion recognition is at least partly due to the diffuse 
nature of the electron cloud of anions”. By contrast, cation channels 
often show a high degree of selectivity, which is made possible by 
the point-like charge properties of such ions. Because different FNT 
channels in this ancient protein family are coupled with their specific 
metabolic pathways, each channel is expected to have its own preference 
for permeating ions, and that preference is probably due to the small 
structural variations in the selectivity filter that we observed here (Sup- 
plementary Fig. 10). 

The expulsion of toxic concentrations of HS” from the cytoplasm 
through HSC needs to be balanced against the potential risk of the 
uncoupling of the membrane proton gradient. Freely diffusible HS 
and other weak, short-chain acids will dissociate into an anion and a 
proton in the cytoplasm (Fig. 1a). As the anion is expelled, there will be 
an intracellular net accumulation of protons. Therefore, the ground 
state of HSC is likely to be closed and its opening tightly regulated. 

Indeed, the HSC structure we determined at high pH is in its closed 
state (Fig. 3 and Supplementary Figs 9-11). We then considered how 
easily the channel could be opened. Because the homologous FocA 
channel can be gated by an appropriate change in pH or cytoplasmic 
formate concentration''"*, we first investigated possible pH-dependent 
structural changes of HSC by determining its crystal structure at pH 7.5 
(resolution, 3.2 A) and at pH 4.5 (3.0 A) (Supplementary Table 2). The 
protein structures obtained at these neutral and low pH values were the 
same as the structure at 2.2 A and pH 9.0 (Supplementary Fig. 12), and 
the channel remained closed. We then studied the two salt-bridge 
triads, both of which contain a glutamate residue that could potentially 
function as a pH sensor (Fig. 4a—c). When either of the salt bridges was 
broken, by mutating Lys 16 to Ser or Lys 148 to Glu, the HSC crystal 
structure did not change and the channel still remained closed 
(Supplementary Fig. 13 and Supplementary Table 2). Consistent with 
these structural results, the transport rates of these mutants did not 
increase (Fig. 4d). 

To determine whether HSC is gated by anion concentration in a 
similar manner to V. cholerae FocA"', in which the competition of 
formate with Thr90 from the Q-loop for hydrogen bonding to 
His 208 in the selectivity filter opens the channel, we mutated the 
equivalent threonine in HSC, Thr 84, to an Ala and determined its 
crystal structure at 2.4 A (Supplementary Table 2). This mutant struc- 
ture did not differ from the closed wild-type channel, and no increase 
in transport was detected (Fig. 4d and Supplementary Fig. 14). Because 
Thr 84, Lys 16 and Lys 148 all line the ion permeation pathway, muta- 
tion of these residues slightly reduced the ion permeation rate of the 
channel. Therefore, we conclude that the opening of HSC is indeed 
tightly regulated and the channel has a much lower open probability 
than FocA’"®, 
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The low open probability of HSC observed here is consistent with its 
physiological role in the cell. The opening of HSC probably requires 
significant movement of TM2b. As TM2b is ‘held’ in space by the 
cytoplasmic helix N (Fig. 4a), the movement of the latter, for example 
by direct interaction with a cytoplasmic enzyme in the same metabolic 
pathway, can trigger the opening of the channel. We have no evidence 
for such a hypothesis, and it can be validated only by further experi- 
mentation, which we expect would reveal greater variations in both the 
gating mechanism and the anionic preference of FNT channels. 


METHODS SUMMARY 

Bismuth sulphite agar plate assay. FNT3 was isolated from C. difficile strain 630. 
Escherichia coli BL21(DE3) transformed with vector pCDFduet-1-asrA-asrB, 
pACYCduet-1-asrC, or pACYCduet-1-asrC-FNT3 were plated on bismuth sulphite 
agar” and allowed to develop for 48 h. 

Whole-cell in vivo transport. Salmonella typhimurium transformed with pBAD- 
FNT3 were assayed for their ability to generate sulphide from either thiosulphate 
or sulphite using a silver-sulphide probe following modification of a previously 
published protocol”. 

Proteoliposome reconstitution and in vitro transport assay. Proteoliposomes 
were formed from E. coli total lipids and Ni**-NTA-purified proteins and assayed 
bya method modified from a previously published concentrative uptake assay'!”’. 
Protein-ion interaction assay. lon-induced thermostability of HSC was monitored 
using analytical size-exclusion chromatography. Purified protein was incubated at 
increasing temperatures and its recovery was monitored by size-exclusion chromato- 
graphy using a high-performance liquid chromatograph. Candidate permeating 
ions were incubated at a concentration of 20 mM and protein was monitored for 
thermostabilization. 

Protein purification and crystallization and structure determination. HSC was 
expressed as a carboxy-terminal FLAG-10 His fusion in E. coli BL21. The protein 
was purified using Ni*’-NTA affinity chromatography, followed by size-exclusion 
chromatography. The protein was crystallized by hanging-drop vapour diffusion 
and the crystal structure was solved by molecular replacement. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Phylogenetic analysis. Phylogenetic data for 474 bacterial and archaeal members 
of the FNT family were obtained from the HOGENOM database”, aligned using 
the program JALVIEW™ and plotted using the program ARCHAEOPTERYX”. 
Genomic sequences and operon annotations were derived from the NCBI 
GenBank nucleotide database*. 

Bismuth sulphite agar plate assay. Gene FNT3 was isolated from C. difficile strain 
630. The genes for C. difficile asrA (NCBI gene ID, 4915353) and asrB (4915352) 
were cloned into pCDFDuet-1 (EMD Biosciences) and the genes for C. difficile 
asrC (4915351) and FNT3 (4915350) were cloned into pACYCDuet-1 (EMD 
Biosciences). Bismuth sulphite agar plates*® were freshly prepared following the 
manufacturer’s protocol and supplemented with 1mM _ isopropyl -p-1- 
thiogalactopyranoside and the antibiotics chloramphenical and spectinomycin. 
E. coli BL21(DE3) cells were transformed with the indicated vectors and streaked, 
and colonies were allowed to grow at 37 °C for 48h. The bismuth sulphite agar” 
contained the ATP synthase inhibitor brilliant green’” and high concentrations of 
SO; 7. When C. difficile asrABC was induced, E. coli were able to overcome the 
growth inhibition by producing ATP through glycolysis by reducing SO; * to 
H2S. 

Whole-cell in vivo transport assay. Salmonella typhimurium LT2 was trans- 
formed with the FNT3 cloned into a pBAD vector (Invitrogen). Cells were adapted 
in MOPS medium supplemented with 10 mM glucose and thiamine at 1 1g ml. 
A fraction of each culture (5 ml) was centrifuged and resuspended in fresh media 
supplemented with ampicillin, 0.2% arabinose and either 6 mM sodium sulphite 
or 6mM sodium thiosulphate. Aliquots were incubated at 37 °C for 6h and then 
centrifuged to pellet bacteria. Medium was gently poured into an equal volume of 
sulphide antioxidant buffer. Exported sulphide was measured using a combination 
silver sulphide electrode (Cole-Palmer). The concentration of sulphide in the 
medium was determined using standards of known sodium sulphide concentra- 
tions in the same antioxidant buffer. 

Protein purification. FNT3 was cloned into a modified pBAD vector (Invitrogen), 
creating a C-terminal TEV-FLAG- 10 His fusion protein, and transformed into 
E. coli BL21 plysS (Sigma-Aldrich) cells''’**°. Transformed cells were grown at 
37 °C to Déoo nm 1.0 and induced with 0.2% of arabinose for 4 h. Cells were collected 
and lysed by passing twice through an Emulsiflex cell disrupter (Avestin). 
Lysate was clarified by a low-speed spin at 12,000g and solubilized with 1% 
B-dodecyl-maltopyranoside (DDM, Anatrace). Solubilized lysate was incubated 
with Ni’?-NTA resin (Qiagen) and the bound protein was eluted with buffer 
containing 1.1% f-octyl-glucopyranoside (OG, Anatrace), 50mM Tris 8.0, 
200mM NaCl, 300mM imidazole and 10% glycerol. The protein sample was 
incubated with TEV protease overnight to remove the tags and then underwent 
size-exclusion chromatography in a Superdex 200 in 25 mM Tris, pH 6.8, 100 mM 
NaCl and 1.1% OG. 

Mass spectrometry. The mass of purified HSC was determined using matrix- 
assisted laser desorption/ionization time-of-flight mass spectrometry in the 
laboratory of Dr T. Neubert following published protocols**~*. 

Protein-ion interaction assay. Ion-binding-induced thermostability of purified 
protein samples was measured using size-exclusion chromatography. Aliquots of 
100 mg of Ni*?-NTA-purified protein were incubated for 10 min ina thermocycler 
at increasing temperatures. Samples were injected onto a Shodex KW804 analytical 
size-exclusion chromatography column (Thomson) on a high-performance liquid 
chromatograph (Shimadzu) equilibrated in 200 mM Na2SOu,, 50 mM Tris, pH 8.0, 
3mM NaN; and 0.05% DDM. The height of the monodisperse peak was used to 
quantify the amount of remaining sample. The nominal melting temperature, T,,, 
defined as the temperature at which 50% of the protein remained soluble, was 
calculated by fitting the data to a Boltzmann sigmoidal model in PRISM5 
(GraphPad Software). For screening of various compounds, 20 mM of each sodium 
salt was added before incubation. 

Proteoliposome reconstitution and in vitro transport assay. Transport of 
radiolabelled formate was measured in a concentrative uptake assay!!”. 
Escherichia coli total lipids were aliquoted in chloroform and dried under nitrogen. 
Following resuspension in intraliposomal buffer (150 mM sodium formate and 
50 mM Tris, pH 8.0), lipids were overlayed with nitrogen, freeze-thawed ten times, 
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vortexed and sonicated to homogeneity. Proteoliposomes were formed by the 
addition of 1% OG and Ni‘?-NTA-purified HSC at a protein-to-lipid ratio of 
1:10,000 (w/w). OG detergent was removed by incubation with 400 mg ml! Bio- 
Beads SM2 (Bio-Rad) overnight. Proteoliposomes were then extruded through a 
400-nm membrane filter and frozen at —20 °C until use. Concentrated re-uptake 
of formate was initiated by buffer-exchanging 200 ml of thawed proteoliposomes 
over 2 ml of G50 Sephadex swollen in extraliposomal buffer (150 mM glutamate, 
50mM Tris, pH 8.0). Carbon-14-labelled sodium formate (450M, American 
Radiolabelled Chemical) was added to the collected proteoliposomes, and the 
assays were terminated by centrifuging a 20-ml sample at 1,000g for 1 min at each 
time point through a G-50 Probequant Micro-column (GE Healthcare) to remove 
the external radiolabelled formate. The amount of transported radiolabelled 
formate was quantified by adding 1 ml of scintillation fluid to each sample and 
emitted photons were counted using a Wallac scintillation counter. For competi- 
tion assays, increasing concentrations of sodium sulphite, sodium nitrite or 
sodium hydrosulphide (0, 0.15, 0.6, 3 and 15 mM), were added before the addition 
of radiolabelled formate. 

Protein crystallization. High-pH, primitive orthorhombic crystals of HSC were 
grown using the vapour diffusion method, by mixing protein at a concentration of 
1l1mgml’ in a 1:1 ratio with reservoir solution containing 25-27% (v/v) PEG 
400, 100mM Tris, pH 7.0-9.0, and 100mM sodium sulphite, with or without 
50mM sodium hydrosulphide. Medium-pH, monoclinic crystals were grown in 
reservoir solution containing 25-27% (v/v) PEG 400, 100mM Tris, pH7.5, and 
100 mM sodium nitrite. Low-pH, C-centred orthorhombic crystals were grown in 
buffer containing 100 mM sodium sulphite, 100 mM sodium nitrite, 8-10% (w/v) 
PEG 6000, 100 mM zinc acetate and 100 mM sodium acetate, pH 4.5. 

Structure determination. X-ray diffraction data were collected at beamlines X25 
and X29 of the National Synchrotron Light Source at Brookhaven National 
Laboratory and beamline 23ID of the Advanced Photon Source at Argonne 
National Laboratory. Images were processed and scaled using the HKL2000 
suite’. The structure was solved by molecular replacement using the V. cholerae 
FocA structure'' (PDB code: 3KLY) as a search model. Models were built using 
PHENIX Autobuild*’ and manually adjusted using COOT"’. Refinement was 
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Crystal structure of a concentrative nucleoside 
transporter from Vibrio cholerae at 2.4 A 


Zachary Lee Johnson’, Cheom-Gil Cheong! & Seok-Yong Lee! 


Nucleosides are required for DNA and RNA synthesis, and the 
nucleoside adenosine has a function in a variety of signalling pro- 
cesses'”. Transport of nucleosides across cell membranes provides 
the major source of nucleosides in many cell types and is also 
responsible for the termination of adenosine signalling. As a result 
of their hydrophilic nature, nucleosides require a specialized class 
of integral membrane proteins, known as nucleoside transporters 
(NTs), for specific transport across cell membranes. In addition to 
nucleosides, NTs are important determinants for the transport of 
nucleoside-derived drugs across cell membranes*°. A wide range 
of nucleoside-derived drugs, including anticancer drugs (such as 
Ara-C and gemcitabine) and antiviral drugs (such as zidovudine 
and ribavirin), have been shown to depend, at least in part, on NTs 
for transport across cell membranes** ’. Concentrative nucleoside 
transporters, members of the solute carrier transporter superfamily 
SLC28, use an ion gradient in the active transport of both nucleosides 
and nucleoside-derived drugs against their chemical gradients. The 
structural basis for selective ion-coupled nucleoside transport by 
concentrative nucleoside transporters is unknown. Here we present 
the crystal structure of a concentrative nucleoside transporter from 
Vibrio cholerae in complex with uridine at 2.4. A. Our functional 
data show that, like its human orthologues, the transporter uses a 
sodium-ion gradient for nucleoside transport. The structure reveals 
the overall architecture of this class of transporter, unravels the 
molecular determinants for nucleoside and sodium binding, 
and provides a framework for understanding the mechanism of 
nucleoside and nucleoside drug transport across cell membranes. 

Humans have three isoforms of concentrative nucleoside trans- 
porter (hCNT), and the substrate specificities and tissue distributions 
of these isoforms are different'*'°. Knowledge of the mechanism of 
these transporters would help us not only to understand physiological 
processes associated with nucleosides but also to provide a framework 
for future drug design to improve nucleoside drug delivery. The major 
barrier to achieving a mechanistic understanding is the lack of atomic 
structures that reveal the origins of nucleoside specificity and the 
principles of function of CNTs. For structure determination and func- 
tional studies we chose a CNT homologue from Vibrio cholerae 
(vcCNT) because of its high sequence homology to hCNTs (39% 
identical to hCNT3; Supplementary Fig. 1) and optimal biochemical 
stability. 

To test whether vcCNT can transport nucleosides, and if so what ion 
gradient vcCNT uses, we performed a radioactive nucleoside uptake 
assay using recombinant vcCNT-reconstituted liposomes. Studies have 
shown that the human CNTs use Na” ions and the Escherichia coli CNT 
uses H* for nucleoside transport'”"’. Transport activity was measured 
by monitoring the uptake of [5,6-*H]uridine in both the presence and 
the absence of a Na” gradient (Fig. 1a). We chose uridine because all of 
the CNTs that have been characterized so far transport uridine'*’”””. 
When a Na’ gradient was present, vesicles containing vcCNT accumu- 
lated radioactive uridine significantly more than the control empty 
vesicles. When a Na‘ gradient was not present (Na~ was replaced with 
choline in the solution), uridine uptake was significantly decreased. 
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Figure 1 | vcCNT is a Na‘ -coupled nucleoside transporter with a trimeric 
architecture. a, Time course of the uptake of 2.4 uM [5,6-"H]uridine into 
vesicles containing vcCNT in the presence ofa Na’ gradient (squares, NaCl), in 
the absence of a Na~ gradient (triangles, choline chloride), and the control 
empty vesicles in the presence of Na* (diamonds, empty). Results are 
means = s.d. (n = 3). b, Cartoon representation of the vcCNT trimer viewed 
from the cytoplasm. Individual protomers are coloured blue, red and green. 
c, Cartoon representation for the vcCNT trimer viewed parallel to the 
membrane. The putative membrane bilayer is indicated in c with grey boxes. 
The dimensions of the putative membrane bilayer, extracellular region, and 
intracellular region are shown. 
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Further characterization showed that uridine uptake was electrogenic 
and independent of pH gradients (Supplementary Fig. 2). These data 
clearly show that vcCNT, like human CNTs, uses a Na* gradient to 
transport nucleosides. 

We crystallized and solved the structure of vcCNT at 24A 
(Supplementary Table 1). Experimental phases to 3.5 A were obtained 
by single anomalous dispersion (SAD) from a platinum-soaked crystal 
(see Methods). The final model contains a single protomer in the 
asymmetric unit and is of good quality with a free R factor of 22.8% 
(Supplementary Table 1). 

vcCNT crystallizes as a homotrimer that is shaped like an inverted 
triangular basin with its mouth facing the intracellular side and a knob- 
like structure facing the extracellular side (Fig. 1b, c). The three-fold 
axis coincides with the crystallographic six-fold axis and is perpendic- 
ular to the membrane. When viewed from the intracellular side, each 
side of the triangle formed by the trimer is about 92 A, and when 
viewed parallel to the membrane the trimer is about 57 A in height 
(Fig. 1). The membrane-embedded region lies roughly in the middle of 
the transporter, judging from the positions of three amphipathic helices 
on each protomer. On the basis of the predicted location of the 
membrane bilayer, the mouth of the basin penetrates into the mem- 
brane plane (Fig. 1c and Supplementary Fig. 3). The presence of several 
polar amino acids on the basin surface probably allows bulk aqueous 
solution to reach deep into the membrane bilayer and access this 
surface of the transporter. 

The structure reveals that each protomer contains eight trans- 
membrane helices (TM1-TM8), two re-entrant helix—turn-helix 
hairpins (HP1 and HP2) with opposite orientations in the membrane, 
and three interfacial helices (IH1-IH3) that run parallel to the 
membrane (Fig. 2a). The orientation of the structure relative to the 


Figure 2 | Topology and fold of the vcCNT protomer. a, Schematic 
representation of vcCNT topology. The group of helices under the pink 
triangular background is related to the group of helices under the cyan 
triangular background by two-fold pseudo-symmetry, with the symmetry axis 
parallel to the membrane. b, Cartoon representation of the vcCNT protomer 
fold. Only helices comprising the scaffold domain and the group of helices 
under the pink triangular background are labelled. c, As in b but rotated 
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membrane is consistent with previous accessibility studies*” * and con- 


forms to the positive-inside rule (Supplementary Fig. 4). hCNTs are 
predicted to contain three more amino-terminal transmembrane helices 
than prokaryotic CNTs, suggesting an 11-TM topology for hCNTs”. 

Each protomer can be grouped into two subdomains on the basis of 
their locations (at the outer and inner regions) relative to the centre of 
the protomer. TM1, TM2, IH1, EH (a short stretch of extracellular 
helices), TM3 and TM6 are located at the outer part of the transporter, 
and they seem to be important for maintaining the overall architecture 
of the transporter. These helices in the outer region form a scaffold for 
the transporter (hereafter termed the scaffold domain) (Fig. 2). 
Trimerization contacts are mediated by part of the scaffold domain: 
IH1, EH, TM3 and TM6 (Fig. 2b). IH1 is an amphipathic helix 40 A 
long that is most probably situated at the water/membrane interface, 
and TM6 is about 60 A long and tilted almost 60° with respect to the 
membrane normal. EH protrudes into the extracellular solution and is 
about 16 A in length. Because IH1 is involved in trimerization, is con- 
strained to be at the membrane/water interface and interacts with 
many TMs (TM2-TMS), it is most probably important in building 
and maintaining the overall structure of vcCNT. 

Surrounded by the scaffold domain, many conserved amino acids 
implicated in nucleoside transport are localized at the inner domain 
(hereafter termed the transport domain). The transport domain is 
composed of two structural groups that are related by an internal 
two-fold pseudo-symmetry. The first group is composed of IH2, 
HP1, TM4a/b and TMS (pink background in Fig. 2a), and the second 
group is composed of IH3, HP2, TM7a/b and TM8 (cyan background 
in Fig. 2a). These two groups, separated by TM6 in sequence, can be 
superimposed with a root mean squared deviation of 3.3 A, with the 
two-fold symmetry operator running though the middle of TM6 and 


through 120°. Only TM1, TM6 and helices comprising the group of helices 
under the cyan triangular background are labelled. d, As in c but rotated 
through 120°. The scaffold domain has been removed to show the pseudo-two- 
fold relationship between the two groups of helices, coloured as pink and cyan 
triangular backgrounds. The two-fold symmetry axis runs through the 
nucleoside at the centre of the transporter. 
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parallel to the membrane (Fig. 2d). There is no significant amino-acid 
sequence homology between these two groups (roughly 10% sequence 
identity). This two-fold symmetry relationship positions the tips of 
HP1, HP2 and unwound regions of TM4 and TM7 at the centre of 
the transport domain, which is located slightly below the middle of the 
membrane plane (Fig. 2d). Sequence alignment of hCNTs and vcCNT 
reveals high sequence conservation around HP1, HP2 and the two 
unwound helices (TM4 and TM7), indicating the functional import- 
ance of this region (Supplementary Fig. 1). 

To the best of our knowledge, the overall fold of vcCNT is novel, 
although local structural elements such as helical hairpins and 
unwound helices have been observed previously**”*. 

The crystal structure suggests that vcCNT adopts a trimeric con- 
figuration. To test whether the stoichiometry of vcCNT is trimeric and 
further validate the physiological relevance of our crystal structure, we 
performed structure-guided disulphide bridge crosslinking experi- 
ments (Supplementary Fig. 5). Our cysteine mutants readily form 
disulphide-crosslinked trimers in both detergent micelles and cell 
membranes under oxidizing conditions; therefore our crystal structure 
reflects a physiologically relevant oligomerization state. Given the 
sequence conservation of most of the amino acids involved in trimer- 
ization, we propose that the stoichiometry of both eukaryotic and 
prokaryotic CNT family members is trimeric (Supplementary Fig. 1). 

The structure contains three deep clefts (one per subunit) at the 
intracellular side facing the centre of the trimer (Fig. 3a). A simulated- 
annealing OMIT map clearly shows that the electron density in the 
cleft is that of uridine (Fig. 3b). The bound uridine in the cleft faces the 
intracellular basin of the transporter; however, it is not free to be 
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Figure 3 | Nucleoside-binding site and Na*-binding site. a, Cut-away 
surface representation of vcCNT viewed parallel to the membrane. The putative 
membrane bilayer is shown by horizontal lines. Uridine is shown as spheres. 
b, The vcCNT protomer viewed from the centre of the trimer. The scaffold 
domain is shown in ribbon representation; the transport domain is shown in 
cartoon representation. The blue mesh, covered by TM6 (green), is an F, — F. 
simulated annealing OMIT map, contoured at 4a, showing density for uridine. 
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released into the intracellular solution because TM6 and TM7b partly 
cover the binding site (Fig. 3a, b and Supplementary Fig. 6). 

The nucleoside-binding site is located at the centre of the internal 
two-fold symmetry and lined by the tips of HP1 and HP2 and the 
unwound regions of TM4 and TM7 (Fig. 3b,c). Inspection of the 
interactions between uridine and the binding site shows that many 
polar or charged amino acids from HP1, HP2, TM4 and TM7 interact 
with the uracil base and ribose. HP1 and TM4b are responsible for 
interacting with the uracil base. The side chains of Gln 154, Thr 155 
and Glu 156 from HP1 interact with the uracil base either directly 
(Gln 154) or indirectly through a water molecule (Thr155 and 
Glu 156; Fig. 3c). Val 188 from TM4b interacts with the uracil base 
by means of van der Waals interactions. The involvement of these 
amino-acid residues in the interactions with the nucleoside is consist- 
ent with previous mutational studies**”’: the residue corresponding to 
Gln 154, together with that corresponding to Val 188, is important for 
nucleoside specificity of hCNTs; Glu 156 is critical for Na * -nucleoside 
coupled transport. 

HP2 and TM7 are responsible for the interactions with ribose. The 
side chains of Glu332 (HP2), Asn 368 (TM7) and Ser371 (TM7) 
interact with the ribose either directly (Glu 332, Asn 368 and Ser 371) 
or indirectly through a water molecule (Asn 368). Mutation of the 
residue corresponding to Glu 332 in hCNTs has been shown to have 
significant functional effects on both nucleoside binding and the rate of 
transport’’”®. 

Because vcCNT is a Na*-coupled transporter, it must contain at 
least one Na -binding site. Initial hints regarding the location of the 
Na‘ -binding site came from an F, — F, map that shows a strong peak 


The inset is a zoomed-in image of the simulated annealing OMIT electron 
density. c, The nucleoside-binding site, showing HP1 (red), TM4b (orange), HP2 
(blue) and TM7b (cyan). Hydrogen bonds are shown as dashed lines. d, The 
Na -binding site is located between HP1 (red) and the unwound region of TM4 
(orange). The blue mesh is an F, — F. simulated annealing OMIT map, 
contoured at 6c, showing density for Na*. Coordination of the Na“ ion is shown 
as dashed lines. e, The Na‘ -binding site is near the nucleoside-binding site. 
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Inward facing Outward facing 


Figure 4 | Hypothetical mechanism of nucleoside transport. Only a single 
protomer is shown for simplicity, viewed from the centre of the trimer (same 
orientation as in Fig. 3b). On the left, the transporter adopts an inward-facing 
conformation. On the right, the transporter adopts an outward-facing 
conformation. Uridine is bound between the tips of HP1 and HP2, and TM6 
(grey), serving as a hydrophobic barrier, partly covers uridine. The blue dotted 
lines demarcate the mobile part of the transport domain, and the grey region 
denotes the scaffold domain. Green circles represent sodium ions. A rigid-body 
motion of the nucleoside-binding site across TM6 can expose the binding site to 
the extracellular side. 


(about 80) between the tip of HP1 and the unwound region of TM4. 
Further inspection reveals that this site is octahedrally coordinated by 
three backbone carbonyl groups, two side-chain hydroxyl groups anda 
water molecule (Fig. 3d) with the typical distances (about 2.4 A) for 
Na* coordination by protein’. Because it is difficult to distinguish 
Na* from H,0 on the basis of electron density alone, we performed 
the valence test that was used to identify the Na* sites in the crystal 
structure of LeuT?*”’. The result (v = 1.02) is consistent with Na‘ as 
the bound ion in this site. Therefore we suggest that vcCNT has at least 
one Na‘ -binding site at this position between HP1 and the unwound 
region of TM4. 

The Na“ -binding site is near the nucleoside-binding site, but the Na* 
does not interact directly with the nucleoside (Fig. 3e). What is the 
functional role of Na* binding at this site? Because key amino-acid 
residues for the interactions with the nucleoside base are localized on 
HP1 and TM4b, we propose that the role of Na” -bindingis to bring HP1 
close to TM4 to complete the formation of the nucleoside-binding site. 

In the field of Na*-coupled secondary transporters, the prevailing 
model for ion-coupled transport is described by the alternating-access 
mechanism*”. If we assume that the alternating-access model is used, 
our crystal structure probably represents an inward-facing occluded 
conformation (Fig. 3a and Supplementary Fig. 6). On the basis of our 
crystal structure, previous accessibility studies of hCNTs, and studies 
of the Na*-coupled aspartate transporter Gltp, which shares certain 
structural features with vcCNT (Supplementary Fig. 7), we propose a 
hypothetical model to explain Na*-coupled nucleoside transport by 
vcCNT (Fig. 4). A rigid-body movement of the transport domain 
across TM6 while the scaffold domain is held in place could permit 
the transition from an inward-facing to an outward-facing conforma- 
tion. Our reasoning in proposing this mechanism is provided in 
Supplementary Information. 

The crystal structure provides an important step towards under- 
standing the mechanism of nucleoside transport by the CNT family. 
However, many important issues remain to be elucidated, including 
the structural basis for nucleoside and nucleoside-drug specificity and 
the conformational changes associated with each step along the trans- 
port cycle. 


METHODS SUMMARY 


vcCNT was expressed in E. coli C41 cells. Cells were lysed and protein was solubilized 
with dodecyl maltoside. Protein was purified by Co’ -affinity chromatography 
followed by gel-filtration chromatography in the presence of decyl maltoside and 
uridine. 
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vcCNT was crystallized in the presence of 100 mM CaCl,, 37-42% PEG400 and 
100 mM Tris-HCl pH 9.0 or 100 mM glycine pH 9.5. Platinum derivatives were 
prepared by soaking the crystals in K,Pt(CNS)¢. A partial poly-Ala model was 
built with phases to 5.0 A from single isomorphous replacement with anomalous 
scattering (SIRAS). A complete model was built by combining phases from single 
anomalous dispersion using a new platinum derivative with molecular replacement 
phases from the partial model (MR-SAD) to 3.5 A. Structure refinement was then 
performed against the 2.4-A native data. The final model is of good quality with 
good Ramachandran statistics (98% favoured and 2% allowed). The model 
contains residues 2-416 (residues 230-240 missing), a uridine, 5 decylmaltosides 
(1 full and 4 partial), a Na* ion and 43 bound waters. Disordered residues Lys 226 
and Glu 345 were modelled as Ala. 

For the flux assay, protein was reconstituted into lipid vesicles and then vesicles 
were diluted into buffer containing either choline chloride or NaCl. Flux was 
initiated by the addition of 2.4 1M [5,6-*H]uridine and 1 1M valinomycin. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Expression and purification. The gene encoding vcCNT was cloned from Vibrio 
cholerae genomic DNA into a modified pET26 vector that contains a pelB leader 
sequence and a PreScission Protease cleavable His, -maltose-binding protein 
fusion. The original vector was a gift from R. Dutzler (University of Zurich) and 
was further modified. The vector containing vcCNT was expressed in E. coli C41 
(DE3) cells. Cells were lysed with a homogenizer (Avestin) and protein was 
solubilized with 30 mM dodecyl maltoside (DDM) for 2h at 4°C. Solubilized 
lysates were spun down to remove the insoluble fraction, and supernatants were 
applied to Talon Co” affinity resin. After binding, protein was eluted with 
imidazole and digested overnight with PreScission Protease. The digestion 
mixture was concentrated and applied to a Superdex 200 size-exclusion column 
in the presence of 5mM decyl maltoside (DM), 1 mM uridine, 150mM NaCl, 
20mM Tris-HCl pH 8.0, 2mM dithiothreitol. Peak fractions corresponding to 
vcCNT were collected for further experiments. 

Crystallization. Initial crystallization conditions were obtained at the high- 
throughput crystallization-screening laboratory at the Hauptman-Woodward 
Institute*!. Protein was concentrated to about 10 mgml~! and mixed 1:1 with 
crystallization solution containing 100mM CaCl, 37-42% PEG400 and 
100 mM buffer. Crystals grew over a wide pH range (5.6-9.5), but data for struc- 
ture solution were collected on crystals grown at pH 9.0 (100 mM Tris-HCl) and 
pH9.5 (100 mM glycine). Crystals were grown by using the microbatch-under-oil 
method. After 10-14 days, crystals were harvested, transferred to cryo solution 
containing 32.5% PEG400, and flash-frozen in liquid nitrogen. Platinum deriva- 
tives were prepared by soaking for 2-4h in 2.5 mM K,Pt(CNS), and then trans- 
ferring them to cryo solution and flash-freezing. 

Structure determination. The data were collected on beamlines 22ID, 22BM and 
24ID-C at the Advanced Photon Source. The data were processed with HKL2000 
(ref. 32). Crystals of veCNT diffract to 2.4 A Bragg spacings and belong to the space 
group P63. Extensive screening of crystals was performed because roughly 70% of 
crystals are merohedrally twinned, with the twinning operator perpendicular to 
the crystallographic six-fold axis, leading to the apparent space group P6322. We 
found that SeMet-substituted crystals were almost always twinned with significant 
twinning fractions (20-45%), which necessitated heavy-atom-soaked crystals as 
the choice for a de novo phasing method for the vcCNT structure. Initial phases to 
5.0 A were obtained by single isomorphous replacement with anomalous scatter- 
ing (SIRAS) from a platinum-soaked crystal. Platinum sites were found by 
SHELXD*; phasing was calculated to 5.0 A Bragg spacings by using SOLVE™ with 
the figure of merit 0.34, and the density was modified by solvent flattening with 
RESOLVE”. A partial model was built by manually placing idealized poly-Ala 
helices into the solvent-flattened electron density map by using COOT™. After 
further extensive screening of platinum derivatives, we found a derivative that 
diffracted to 3.1 A with significant anomalous signal but was non-isomorphous 
with any of our native data. Platinum sites were found from an anomalous differ- 
ence Fourier map by using the partial model phases. Combined phases of single 
anomalous dispersion from the platinum derivative and molecular replacement 


from the partial model (MR-SAD) were calculated at 4.0 Awith the figure of merit 
0.35 and extended to 3.5 A by solvent flattening with PHASER” and RESOLVE* 
with the use of the PHENIX interface”. After iterative cycles of manual adjustment 
of poly-Ala helices and calculation of combined MR-SAD phases, the electron 
density map was of excellent quality and allowed us to place side chains into the 
partial model. We also collected native data to 2.4 A at a long wavelength (1.6 A) 
and identified sulphur sites by using an anomalous difference Fourier map, which 
helped guide the model building. After about 70% of manual model building was 
complete, molecular replacement was performed with the partial model against 
the 2.4 A native data for further model building and refinement. Structure refine- 
ment was performed with PHENIX™. An anomalous difference Fourier map with 
the native data to 2.4 A collected at a long wavelength (1.6 A) identified six sulphur 
sites and helped guide the model building. The final model is of good quality with 
Rwork/Riree = 19.6/22.8% and good Ramachandran statistics (98% favoured and 
2% allowed), and contains residues 2-416 (residues 230-240 missing), a uridine, 5 
DMs (1 full and 4 partial), a Na‘ ion and 43 bound waters. 

Vesicle reconstitution and flux assay. Protein was reconstituted into lipid vesicles 
containing 10 mg ml ' of 1-Palmitoyl-2-oleoyl-sn-glycero-3-phosphoethanolamine 
(POPE) and 1-Palmitoyl-2-oleoyl-sn-glycero-3-phosphoglycerol (POPG) (3:1 ratio 
of POPE:POPG) at a mass ratio of 1:500 protein:lipid, as described previously”. 
Vesicles were reconstituted in the presence of 200mM KCl, 20mM HEPES 
pH7.4, 100mM choline chloride. Reconstituted vesicles were then flash-frozen 
and thawed three times, then extruded through a 1.0-u1m filter with the use of the 
Avanti Mini-Extruder. 

For the flux assay, vesicles were diluted 1:20 into buffer containing 200 mM KCl, 
20 mM HEPES pH 7.4, and either 100 mM choline chloride or 100 mM NaCl. Flux 
was initiated by the addition of 2.4 1M [5,6-*H]uridine and 1 uM valinomycin. All 
experiments were performed in triplicate at 30 °C. Vesicles were harvested on GF/B 
glass microfibre filters (Whatman) and counted by scintillation on the following day. 
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Collapse of polar ice sheets during the stage 11 


interglacial 


Maureen E. Raymo! & Jerry X. Mitrovica? 


Contentious observations of Pleistocene shoreline features on the 
tectonically stable islands of Bermuda and the Bahamas have 
suggested that sea level about 400,000 years ago was more than 20 
metres higher than it is today’*. Geochronologic and geomorphic 
evidence indicates that these features formed during interglacial 
marine isotope stage (MIS) 11, an unusually long interval of warmth 
during the ice age’*. Previous work has advanced two divergent 
hypotheses for these shoreline features: first, significant melting of 
the East Antarctic Ice Sheet, in addition to the collapse of the West 
Antarctic Ice Sheet and the Greenland Ice Sheet'*; or second, 
emplacement by a mega-tsunami during MIS 11 (ref. 4, 5). Here 
we show that the elevations of these features are corrected downwards 
by ~10 metres when we account for post-glacial crustal subsidence of 
these sites over the course of the anomalously long interglacial. On 
the basis of this correction, we estimate that eustatic sea level rose to 
~6-13 m above the present-day value in the second half of MIS 11. 
This suggests that both the Greenland Ice Sheet and the West 
Antarctic Ice Sheet collapsed during the protracted warm period 
while changes in the volume of the East Antarctic Ice Sheet were 
relatively minor, thereby resolving the long-standing controversy 
over the stability of the East Antarctic Ice Sheet during MIS 11. 

The stability of ice sheets in the face of continuing global warming is 
an issue of significant societal concern. Satellite gravity measurements 
indicate that the Greenland Ice Sheet (GIS) and the West Antarctic Ice 
Sheet (WAIS), the two ice sheets most susceptible to climate change, 
are experiencing a net mass loss*”, with evidence of an accelerating 
pace’ '*. In contrast, the current mass balance of the much larger East 
Antarctic Ice Sheet (EAIS) is uncertain, even in sign®”, though a recent 
study" has inferred EAIS mass loss localized to coastal regions. This 
uncertainty about the stability of the EAIS in a progressively warming 
world has been a key motivation for studies of the palaeoclimate record 
during past warm intervals. 

One such study, a statistical analysis of widely distributed sea-level 
markers related to the last interglacial (MIS 5e; about 120,000 years ago), 
concluded with 95% confidence that eustatic sea level (ESL; defined as 
the globally averaged sea-level change) was >6.6 m higher during MIS 
5e than at the present day, and with 66% confidence that ESL was 
>8.0m higher’’. (This inference, higher than earlier estimates”, is 
supported by a recent analysis of MIS 5e sea-level records from 
Florida’*.) Estimates of the ESL rise associated with collapse of polar 
ice sheets range from 3.4 m (ref. 16) to 7 m for the GIS, and from 3.2 m 
(ref. 17) to 5m for the WAIS, where the upper bounds refer to the 
complete disappearance of the ice sheet. Thus, whereas the estimate of 
peak ESL during MIS 5e implies significant collapse of both the GIS 
and the WAIS, it also implies that the EAIS remained relatively stable. 

It is within this context of assessing potential future instability of the 
EAIS that the sea-level highstand features found at ~20 m (here and 
elsewhere, height above present-day sea level is meant) in Bermuda 
and the Bahamas, and which formed during the MIS 11 interglacial 
(~424-395 kyr ago), have taken on great significance. MIS 11 spanned 
two precession cycles and was the longest interglacial of the past 


500 kyr (refs 18, 19), including the current interglacial MIS 1 (Fig. 1) 
and MIS 5e (Supplementary Fig. 4). If the ESL during the MIS 11 
interglacial peaked at a level 20 m higher than today’, then at least 
8 m of that rise must have come from melting of the EAIS. Geologic 
evidence for a ~20-m sea-level highstand in Bermuda and the 
Bahamas is convincing. In Bermuda, reasonably well-dated deposits 
with thalassinidean shrimp burrows, foraminifera, and gastropods 
characteristic of littoral and intertidal environments constrain relative 
sea level at 21.3 + 1.0m during MIS 11 (refs. 2, 3). On Eleuthera, in the 
Bahamas, a gently sloping erosion surface capped with fenestrae-rich 
intertidal beach deposits provides a maximum sea-level estimate of 
20 + 3m, and the occurrence of pendant fibrous cements suggests a 
minimum sea level of 17 + 2 m (we will henceforth quote a sea-level 
estimate of 18.5 + 3.6 m for this site); multiple dating methods suggest 
that these deposits were formed during MIS 11 (ref. 1). 

How do these observations compare to other MIS 11 sea-level 
indicators or proxies? In a recent survey of MIS 11 sea-level records 
worldwide (most of which are located in tectonically active regions), 
Bowen’ estimated peak MIS 11 sea level using a range of tectonic uplift 
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Figure 1 | Comparison of the duration of the MIS 11 and MIS 1 
interglacials. Plot of the LR04 benthic oxygen isotope stack’* (left-hand 
vertical axis) over a time window spanning the MIS 11 (blue; bottom time scale) 
and MIS 1 (red; top time scale) interglacials. The mean standard error on 5180 
in the LR04 stack is 0.06%o with an age error of +4 kyr for the intervals 
considered here. The juxtaposition illustrates the significantly longer duration 
of maximum interglacial conditions during MIS 11 relative to MIS 1. ESL 
associated with the model ice history used to calculate GIA effects during MIS 
11 is shown by dashed line (right-hand vertical axis). Note the hiatus in model 
ice volume changes from 410 to 401 kyr ago (black bar). An analogous 
comparison between the duration of MIS 11 with MIS 5e can be found in 
Supplementary Fig. 4. 
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corrections based on different assumed ages and elevations of MIS 5e 
sea-level markers. If we adopt Kopp et al.’s’* recent inference of the age 
and peak magnitude of ESL during MIS 5e as a constraint on the latter, 
then Bowen’s estimates from far-field sites imply that peak sea level 
during MIS 11 was 6-9 m above the present-day value, an estimate 
consistent with marine 5'°O isotopic anomalies*’. However, Bowen 
excluded a number of outliers, including the well-studied Bermuda 
and Bahamas sites, and following McMurtry et al.*, he attributed the 
anomalously high elevation of shoreline features at those sites to 
deposition by storms or a mega-tsunami. 

The two explanations for the MIS 11 highstand features in Bermuda 
and the Bahamas—global sea-level rise associated with significant 
EAIS melting’, or mega-tsunami deposition**—ignore the potential 
signal from glacial isostatic adjustment (GIA). The GIA signal may be 
significant at the two sites, as they both are located on the peripheral 
bulge of the ancient Laurentian ice complex. As a consequence, they 
would be subject to local crustal subsidence and sea-level rise of ampli- 
tude ~1-2 mmyr ' during any interglacial". 

To explore the potential contribution from GIA to MIS 11 highstand 
elevations, we compute global sea-level variations over the past 500 kyr 
using a gravitationally self-consistent theory valid for spherically 
symmetric, linear viscoelastic Earth models” (see Supplementary 
Information). As an illustration of the physics of interglacial sea-level 
trends, we plot the predicted change in sea level across the warmest, 
most stable interval in MIS 11 (as implied by 5'°O records), a 9-kyr 
period spanning 410-401 kyr ago that occurred during the second half 
of MIS 11 (Fig. 2). The predictions adopt an Earth model with upper- 
mantle and lower-mantle viscosities of 5 X 107° Pas and 5 X 10*! Pas, 
respectively (model LM”***; see Supplementary Information). 

In the near-field of the former MIS 12 ice sheets, the predicted sea- 
level change is dominated by radial crustal motions. For example, in 
regions once covered by ice sheets, post-glacial rebound of the crust 
produces a sea-level fall of amplitude up to ~1-2 cm yr _', leading toa 
net fall over 9 kyr that can exceed 100 m (these amplitudes are well off 
the scale of Fig. 2). Surrounding these regions of uplift are peripheral 
bulges predicted to subside at rates of up to 0.2-0.3 cmyr_‘ during the 
interglacial. This subsidence accounts for the (red) zones of sea-level 
rise in Fig. 2, with maximum amplitude of ~20m, that encircle the 
Laurentian, Fennoscandian and Antarctic ice complexes. In contrast, a 
sea-level fall of 2-3 m is predicted across most of the far-field of the 
former ice sheets during MIS 11. This fall is due to a combination of 
two processes: (1) deglaciation-induced ocean loading effects, which 
act, near continental margins, to tilt the crust such that continents 
are deformed upward (that is, sea level falls) and offshore regions 
downward’; and (2) the redistribution—or syphoning—of water 
from ocean basins towards regions of peripheral bulge and offshore 
subsidence”. 
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Figure 2 | Predicted sea-level change across the hiatus in ice volume changes 
(410-401 kyr ago) spanned by the model MIS 11 interglacial. The GIA 
calculation is based on the ice history discussed in Supplementary Information 
and the LM” viscosity model. The colour scale saturates in regions within the 
near-field of the late Pleistocene ice sheets. 
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Following Fig. 2, we conclude that in the absence of interglacial ice 
volume changes and steric effects, sea-level highstands will date to the 
end of the interglacial at sites within the subsiding peripheral bulge of 
major ice centres, and to the beginning of the interglacial over most of 
the far-field (Fig. 3). (Of course, any sea-level signal associated with 
interglacial ice volume variations and with ocean temperature and 
salinity changes will be superimposed on these predicted GIA trends.) 
Consider Bermuda: the predicted relative sea-level history for this site 
(Fig. 3 and Supplementary Fig. 1, solid red line) exhibits a monotonic 
rise across MIS 11, reaching a level 11.9 m above the present-day value. 
The predicted highstand for the Bahamas (dashed red line, Fig. 3), 
lying on the outer flank of the same peripheral bulge, is 7.4m. It is 
clear that the amplitude of highstands at Bermuda or the Bahamas, 
indeed at all sites within a peripheral bulge, will be a strong function of 
the duration of the interglacial highstand. In this regard, the protracted 
length of MIS 11, relative to the Holocene (Fig. 1) or MIS 5e (Sup- 
plementary Fig. 4), should be manifested by highstands of particularly 
large magnitude. From this it also follows that simple field com- 
parisons of the relative elevations of Holocene, MIS 5e and MIS 11 
sea-level markers on these islands would lead to an erroneous assess- 
ment of the difference in ESL between those times. 

This result may be stated with more generality. The elevation of an 
ancient interglacial highstand at any site within a peripheral bulge will 
be governed by the difference in the state of isostatic disequilibrium at 
the end of the ancient interglacial relative to the disequilibrium at the 
present day in that same location. As the MIS 11 interglacial was sig- 
nificantly longer than the current interglacial (or MIS 5e), more sub- 
sidence had occurred by the time the final MIS 11 shoreline indicators 
were emplaced on those islands than during the current (or MIS 5e) 
interglacial. Further, as the current interglacial proceeds, sites like 
Bermuda and the Bahamas will be subject to a continuing sea-level rise 
as the present peripheral bulge continues to subside. Thus, the height of 
these ancient MIS 11 highstand features will progressively fall as time 
progresses (even in the absence of any future change in ice volume). 

By contrast, the highstand amplitude at sites located in the far field 
will depend on the difference in isostatic disequilibrium at the beginning 
of the interglacial relative to the disequilibrium at the present day. 
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Figure 3 | Predicted relative sea-level changes across the model MIS 11 
interglacial. The calculations are based on the ice history discussed in 
Supplementary Information and the LM viscosity model. The top two curves 
are predictions for Bermuda (solid red line) and the Bahamas (dashed red line) 
at locations of published data discussed in text. The lower coloured curves, 
predictions for a number of far-field sites, are included for comparison: the Gulf 
of Aden, Red Sea (brown), Curacao (green), Coorong, South Australia 
(magenta), Oahu, Hawaii (blue) and Cape Town, South Africa (cyan). The 
black line shows the ESL variation associated with the adopted ice history (as in 
the dashed blue line in Fig. 1 and the black line in Supplementary Fig. 1) and the 
thin horizontal dashed line references to present-day sea level. 
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Thus, perhaps counter-intuitively, the current elevation of these 
highstands does not depend on the duration of the ancient interglacial. 

Global predictions of current MIS 11 shoreline elevations within each 
of the two zones discussed above are shown in Fig. 4a and b: Fig. 4a 
encompasses sites where the highstand occurred at the end of the 
modelled interglacial hiatus in ice mass change (for example, 401 kyr 
ago) whereas Fig. 4b encompasses sites where the highstand occurs at 
the beginning of the modelled hiatus (410 kyr ago). These results 
represent the predicted contribution to observed MIS 11 highstands 
from GIA alone; this contaminating signal should be removed from 
geological observations before they are used to infer the difference in 
ESL (or ice volume) between MIS 11 and the present. 

As noted above, the GIA contribution to the MIS 11 shoreline 
elevations in Bermuda and the Bahamas is predicted to be 11.9m 
and 7.4m, respectively (Fig. 3). Correcting the published highstand 
estimates at these sites for these GIA predictions yields residual MIS 11 
sea-level elevations of 9.4+ 1m for Bermuda and 11.1+3.6m in 
Eleuthera (Supplementary Table 1). Although additional field obser- 
vations in the Bahamas could greatly decrease the multi-metre error 
associated with these sea-level estimates, we conclude that no signifi- 
cant contribution from melting of the EAIS is required to explain the 
geologic observations at the two sites. 

In our analysis, we implicitly assume that melting of the WAIS and 
the GIS occurred towards the end of the MIS 11 interglacial, such that 
the contribution of the melt event to sea level at Bermuda and the 
Bahamas could be added to a coeval GIA-induced highstand. This 
timing for ice sheet collapse is strongly supported by at least two 
arguments. First, if the collapse took place at the beginning of the 
hiatus in MIS 11 melting (that is, at ~410 kyr ago), then the associated 
ESL rise would have had to be close to 20 m in order to fit the observed 
highstand elevations at Bermuda and the Bahamas. However, in this 
case, the predicted highstand elevation at far-field sites would have 
been even greater than 20 m (that is, an ~20-m melt signal added to 
the GIA-induced highstand ofa few metres; Fig. 3), a situation apparently 
ruled out by far-field observations’. Second, GIA calculations predict a 
submergence of up to several metres for far-field sites by 401 kyr ago 
(Fig. 3) and an average submergence of ~1 m at the specific far-field 
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Figure 4 | Plot of the predicted GIA-induced present-day elevation of MIS 
11 highstands that are currently greater than zero. Elevation is shown for all 
sites in which sea-level peaks either at the end of the hiatus in ice volume 
changes spanned by the model MIS 11 interglacial (a; that is, at 401 kyr ago; see 
also Supplementary Fig. 1) or at the beginning of that hiatus (b; that is, at 
410 kyr ago). 
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sites considered by Bowen’. Therefore, correcting an inferred sea-level 
peak of 6-9 m for this GIA-induced contamination yields a predicted 
ESL ~1m higher (that is, 7-10m), in accord with the residual 
highstand elevations at Bermuda and the Bahamas cited above (Sup- 
plementary Table 1). 

In the Supplementary Information we present results of sensitivity 
tests related to GIA predictions. First, we extend the duration of the 
MIS 11 hiatus in ice mass change from 9 to 14 kyr. In this case, the GIA- 
induced elevation of MIS 11 shorelines at sites within the Laurentian 
peripheral bulge are predicted to increase by ~1-2 m when viscosity 
model LM is adopted, yielding residual MIS 11 highstand elevations of 
7.0 + 1m for Bermuda and 9.9 + 3.6 m in Eleuthera (Supplementary 
Table 1), and thus lowering the inferred peak ESL during MIS 11 by 
~1-2m. Second, we considered an alternative viscosity profile, VM2 
(ref. 26), characterized by a lower-mantle viscosity that is a factor of 
two smaller than model LM. In this case, the predicted GIA contri- 
bution to MIS 11 highstand elevations within the peripheral bulge is 
reduced by a factor of ~2. This reduction yields residual (GIA- 
corrected) MIS 11 elevations of 15.0+1 m and 14.3+3.6m at 
Bermuda and the Bahamas, respectively, values that appear to be at 
odds with lower far-field estimates®. Last, we performed a Monte Carlo 
parameter search in which we varied mantle viscosity, lithospheric 
thickness, and the duration of the modelled hiatus in MIS 11 ice 
volume changes, and tested for the consistency of the GIA-corrected 
highstand elevations. This exercise yields a preferred bound on the 
peak ESL during MIS 11 of 6-13 m. 

In summary, observations of MIS 11 highstand features in Bermuda 
and the Bahamas must be corrected for the significant contaminating 
signal of GIA before these data are used to infer peak ESL during the 
MIS 11 interglacial. We conclude that ESL reached ~6-13 m above the 
present-day value in the late stages of MIS 11. It is unlikely that ocean 
thermal expansion contributed more than ~1 m to this inference’*”’. 
Therefore, although this estimate of peak ESL indicates significant 
collapse of both the WAIS and GIS during the MIS 11 interglacial, it 
rules out any significant melting of the EAIS during this unusually 
prolonged period of ice age warmth. 


Received 30 September 2011; accepted 23 January 2012. 
Published online 14 March 2012. 


1. Hearty, P.J., Kindler, P., Cheng, H. & Edwards, R.L.A +20 m middle Pleistocene sea 
level highstand (Bermuda and the Bahamas) due to partial collapse of Antarctic 
ice. Geology 27, 375-378 (1999). 

2. Olson, S. L. & Hearty, P. J. A sustained +21 m sea level highstand during MIS 11 
(400 ka): direct fossil and sedimentary evidence from Bermuda. Quat. Sci. Rev. 28, 
271-285 (2009). 

3. van Hengstum, P., Scott, D. B. & Javaux, E. Foraminifera in elevated Bermudian 
caves provide further evidence for +21 m eustatic sea level during Marine Isotope 
Stage 11. Quat. Sci. Rev. 28, 1850-1860 (2009). 

4. cMurtry, G. M. et a/. Elevated marine deposits in Bermuda record a late 

Quaternary megatsunami. Sedim. Geol. 200, 155-165 (2007). 

5. Bowen, D. Q. Sea level ~400000 years ago (MIS 11): analogue for present and 

future sea-level? Clim. Past 6, 19-29 (2010). 

6. Velicogna, |. & Wahr, J. Measurements of time-variable gravity show mass loss in 

Antarctica. Science 311, 1754-1756 (2006). 

7. Luthcke, S. B. et al. Recent Greenland ice mass loss by drainage system from 

satellite gravity observations. Science 314, 1286-1289 (2006). 

8. Shepherd, A. & Wingham, D. Recent sea-level contributions of the Antarctic and 

Greenland ice sheets. Science 315, 1529-1532 (2007). 

9. Rignot, E. et al. Recent Antarctic ice mass loss from radar interferometry and 

regional climate modelling. Nature Geosci. 1, 106-110 (2008). 

0. Thomas, R. et al. Accelerated sea level rise from West Antarctica. Science 306, 
255-258 (2004). 

1. Chen, J. L., Wilson, C. R., Blankenship, D. & Tapley, B. D. Accelerated Antarctic ice 
loss from satellite gravity measurements. Nature Geosci. 2, 859-862 (2009). 

2. Velicogna, |. Increasing rates of ice mass loss from the Greenland and Antarctic ice 
sheets revealed by GRACE. Geophys. Res. Lett. 36, L19503, http://dx.doi.org/ 
10.1029/2009GL040222 (2009). 

3. Kopp, R. E,, Simons, F. J., Mitrovica, J. X., Maloof, A. C. & Oppenheimer, M. 
Probabilistic assessment of sea level during the last interglacial stage. Nature 462, 
863-867 (2009). 

4. Jansen, E. etal. in Climate Change 2007: The Physical Science Basis (eds Solomon, 
S. et al.) 433-498 (Cambridge Univ. Press, 2007). 


00 MONTH 2012 | VOL 000 | NATURE | 3 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


15. 


16. 
17. 


18. 
19: 
20. 


21. 


22. 


23. 
24. 


uhs, D. R., Simmons, K. R., Schumann, R. R. & Halley, R. B. Sea-level history of the 
past two interglacial periods: new evidence from U-series dating of reef corals from 
south Florida. Quat. Sci. Rev. 30, 570-590 (2011). 
Otto-Bliesner, B. et al. Simulating Arctic climate warmth and ice field retreat in the 
ast interglaciation. Science 311, 1751-1753 (2006). 
Bamber, J. L,, Riva, R. E. M., Vermeersen, L. L.A, & LeBrocq, A.M. Reassessment of 
the potential sea level rise from a collapse of the West Antarctic Ice Sheet. Science 
324, 901-903 (2009). 
Loutre, M. F. & Berger, A. Marine isotope stage 11 as an analogue for the present 
interglacial. Glob. Planet. Change 36, 209-217 (2003). 
Rohling, E. J. et al. Comparison between Holocene and Marine Isotope Stage-11 
sea level histories. Earth Planet. Sci. Lett. 291, 97-105 (2010). 
cManus, J., Oppo, D., Cullen, J. & Healey, S. in Earth’s Climate and Orbital 
Eccentricity: The Marine Isotope Stage 11 Question (eds Droxler, A. W., Poore, R.Z. & 
Burckle, L. H.) 69-85 (AGU Geophys. Monogr. Ser. 137, 2003). 
itrovica, J. X.& Milne, G.A. On the origin of postglacial ocean syphoning. Quat. Sci. 
Rev. 21, 2179-2190 (2002). 
Kendall, R. A., Mitrovica, J. X. & Milne, G. A. On post-glacial sea level: Il. Numerical 
formulation and comparative results on spherically symmetric models. Geophys. J. 
Int. 161, 679-706 (2005). 
Lambeck, K., Smither, C. & Johnston, P. Sea level change, glacial rebound and 
mantle viscosity for northern Europe. Geophys. J. Int. 134, 102-144 (1998). 
Mitrovica, J. X. & Forte, A. M. A new inference of mantle viscosity based upon a joint 
inversion of convection and glacial isostatic adjustment data. Earth Planet. Sci. Lett 
225, 177-189 (2004). 


4 | NATURE | VOL 000 | 00 MONTH 2012 
©2012 Macmillan Publishers Limited. All rights reserved 


25: 
26. 
27. 


28. 


Nakada, M. & Lambeck, K. Late Pleistocene and Holocene sea-level change in the 
Australian region and mantle rheology. Geophys. J. Int. 96, 497-517 (1989). 
Peltier, W. R. Global glacial isostasy and the surface of the ice-age Earth: the ICE-5G 
(VM2) model and GRACE. Annu. Rev. Earth Planet. Sci. 32, 111-149 (2004). 
McKay, N. P., Overpeck, J. T. & Otto-Bliesner, B. L. The role of ocean thermal 
expansion in Last Interglacial sea level rise. Geophys. Res. Lett. 38, L14605, http:// 
dx.doi.org/10.1029/2011GL048280 (2011). 

Lisiecki, L.E.& Raymo, M.E.A. Pliocene-Pleistocene stack of 57 globally distributed 
benthic 8'°0 records. Paleoceanography 20, PA1003, http://dx.doi.org/10.1029/ 
2004PA001071 (2005). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We thank P. Hearty and D. Bowen for discussions of MIS 11 field 
data, and J. L. Davis for suggestions regarding data analysis. Support for this research 
was provided by NSF-OCE-0825293 and OCE-1202632 (M.E.R.), Harvard University 
(J.X.M.) and the Canadian Institute for Advanced Research (J.X.M.). 


Author Contributions This study was planned, undertaken and written jointly. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of this article at 
www.nature.com/nature. Correspondence and requests for materials should be 
addressed to M.E.R. (raymo@ldeo.columbia.edu). 


ARTICLE 


doi:10.1038/nature10892 


Suppression of the antiviral response by 
an influenza histone mimic 
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Viral infection is commonly associated with virus-driven hijacking of host proteins. Here we describe a novel 
mechanism by which influenza virus affects host cells through the interaction of influenza non-structural protein 1 
(NS1) with the infected cell epigenome. We show that the NS1 protein of influenza A H3N2 subtype possesses a 
histone-like sequence (histone mimic) that is used by the virus to target the human PAF1 transcription elongation 
complex (hPAF1C). We demonstrate that binding of NS1 to hPAFIC depends on the NS1 histone mimic and results in 
suppression of hPAF1C- mediated transcriptional elongation. Furthermore, human PAF1 has a crucial role in the antiviral 
response. Loss of hPAFIC binding by NS1 attenuates influenza infection, whereas hPAFIC deficiency reduces antiviral 
gene expression and renders cells more susceptible to viruses. We propose that the histone mimic in NS1 enables the 
influenza virus to affect inducible gene expression selectively, thus contributing to suppression of the antiviral response. 


Histones are essential regulators of genome function in eukaryotic cells’. 
The amino-terminal domain of histones (the histone tail) provides a 
scaffold for the assembly of protein complexes controlling gene activity’. 
This role of a histone tail is achieved largely by post-translational histone 
modifications, catalysed by distinct modifying enzymes*. The combina- 
tion of diversely modified histone tails establishes gene-region-specific 
patterns that, upon recognition by histone-binding proteins, contribute 
to the regulation of gene replication’, repair® and transcription’. 

The ability of histone tails to guide gene function indicates the 
possibility of targeted control of gene expression by artificial or 
naturally occurring molecules that can structurally and/or functionally 
mimic the histone tail. The former possibility has been demonstrated 
by studies that show the ability of synthetic compounds to interfere 
with inducible gene expression by abolishing the interaction between 
the acetylated histone H4 and the BET family of transcriptional 
regulators*”. Histone binding to transcriptional regulators could also 
be interrupted by exogenous cell-permeable histone peptides’®"'. This 
approach underscores the possibility of a competition between 
endogenous and exogenous histone tails for the common binding 
partners. Indeed, the histone H3 tail-like sequence (histone mimic) 
within histone methyltransferase G9a can compete, in a modification- 
dependent fashion, for binding to the histone-bound heterochromatin 
protein 1 (HP1)'. Overall, the 3-5-amino-acid-long sequences that 
match various parts of the histone tail can be found in numerous 
eukaryotic and prokaryotic proteins; however, the role of these 
presumptive histone mimics in the regulation of gene activity remains 
unknown. 


Influenza virus NS1 carries a histone H3-like sequence 

Pathogens have a known ability to interfere with vital processes in the host 
cells by mimicking regulatory components of host protein networks”. In 
searching for naturally occurring histone mimics with a gene regulatory 
capability, we screened in silico for pathogen-derived proteins that have 


a known or predicted capacity to accumulate within the nuclei of 
infected cells, and bear a histone-tail-like unstructured domain at the 
amino or carboxy termini. One of these proteins was found to be 
the influenza A virus NS1 protein that suppresses host response to 
influenza virus’. 

Depending on the viral subtype, NS1 is a 219-237-amino-acid-long 
protein’. NS1 is not essential for formation of the viral particle but is 
critical for counteracting the antiviral cell response’*"». In the absence 
of NS1, influenza is greatly attenuated’*. Suppression of antiviral host 
response by NSI relies partly on the ability of NS1 to interfere with 
cytosolic signalling processes that regulate the expression of type I 
interferon genes'*'’. Additionally, the NS1 protein can affect host 
gene expression by interfering with RNA splicing and messenger 
RNA export”. 

We found that NS1 of influenza A (H3N2) virus carries a sequence that 
resembles the histone H3 tail. Similar to the histone tail, the C terminus of 
NS1 comprises a non-structured and potentially highly interactive 
domain**”? (Fig. 1a). Although this unstructured C-terminal domain is 
present in most of the human influenza A variants (Fig. 1a), only NS1 
from H3N2 subtype (hereafter defined as NS1) possesses the ARSK 
sequence (amino acids 226-229) that is chemically analogous to the 
"ARTK®* sequence that comprises the lysine 4 site (H3K4) of histone 
H3 (Fig. 1a). The similarity between histone H3 and NS1 tails is further 
strengthened by the ability of the NS1 tail to serve as a substrate for 
histone-modifying enzymes in vitro. Incubation of the NS1 tail peptide 
with lysine methyltransferase Set] complex or Set7/9 in the presence of 
S-adenosylmethionine resulted in incorporation of the methyl mark 
into the peptide (Fig. 1b, upper panel). Accordingly, substitution of 
lysine 229 to arginine (K229R) prevented methylation. These results, as 
well as the ability of histone acetyltransferase TIP60—which acetylates 
histone H3 lysine 4 in yeast*°—to acetylate the NS1 peptide at K229 
(Fig. 1b, upper panel), support the histone mimicry by NS1. The ability 
of the H3K4-like sequence within the NS1 tail to serve as a substrate 
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Figure 1 | Influenza NS1 contains a histone mimic. a, The homologous 
carboxy-terminal NS1 and the amino-terminal histone H3 sequences are 
shown (red letters). The table displays C-terminal NS1 sequences of the 
influenza A subtypes. b, Methylation or acetylation of the NS1 peptide (top 
panel), the GST-NS1 protein (middle panel) or of the viral NS1 in A549 
infected cells (bottom panel) are shown. KR, NS1 substrates where K229 is 
replaced by arginine. c, Association of the NS1 histone mimic with the hPAF1C 


for chosen histone-modifying enzymes has been further confirmed 
by using recombinant GST fusion NS1 protein (Fig. 1b, middle panel). 
Finally, the H3K4-like site within NS1 could be methylated or 
acetylated in virally infected cells. Using a custom-made antibody 
against dimethylated lysine 229 in NS1 (NSlme2; Supplemen- 
tary Fig. 1), or acetyl-specific antibody”’, we found the presence of 
methylated and acetylated NS1 in infected human cells (Fig. 1b, lower 
panel). Point mutation of the NS1 gene that causes K229R substitution 
prevented modification (Fig. 1b, lower panel). It is likely that members 
of the H3K4 methyltransferase and acetyltransferase enzymes, that can 
modify NS1 in vitro, are responsible for NS1 modification in vivo. 
However, in view of the known redundancy between multiple H3K4 
modifying complexes****°, identification of a sole NS1 modifying 
enzyme seems unlikely. 


NSI interacts with transcript-elongating proteins 

Our experiments show the presence of a histone mimic within the NS1 
tail. In histones, the unmodified or modified histone tail contributes to 
the assembly of chromatin protein complexes*’~**. The nature of the 
histone-tail-binding proteins is commonly revealed by the identifica- 
tion of nuclear proteins that bind to immobilized histone peptides 
during affinity purification®’. Using this approach (Supplementary 
Fig. 2), we searched for human nuclear proteins that bind to the NS1 
tail peptide (amino acids 220-230). To control for the specificity of 
binding we used a scrambled peptide with an amino acid composition 
identical to NS1. Mass-spectrometry analysis of NS1-tail-bound 
proteins revealed the association of the NS1 tail with polypeptides that 
belong to the human PAF1 transcription elongation (hPAF1C) and 
CHD1 chromatin remodelling complexes (Supplementary Fig. 2). 
Binding of the NS1 tail to human PAF1, parafibromin (CDC73), 
SNF21 and CHD1 were validated by western blotting (Fig. 1c). In 
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subunits and CHD1 in nuclear extracts. In, input material. d, NS1 histone 
mimic binds to CHD1. Unmodified or methylated NS1 (K229) or methylated 
H3 (K4) peptides were incubated with the recombinant CHD1 or the CHD1 
double-chromodomain. Binding to NS1 was revealed by silver or Coomassie 
staining (top and bottom panel, respectively). e, Binding of Flag-tagged 
hPAFIC subunits to NS1 or histone H3 peptides was assessed by western 
blotting (left and right panels, respectively). IP, immunoprecipitation. 


support of the histone mimicry by the NS1 tail, we observed the 
histone-H3-like pattern of the NS1 tail binding to CHD1 in vitro. 
The chromodomain of human CHD1 binds specifically to di- or 
trimethylated lysine 4 of histone H3 (ref. 35). Similarly, the full-length 
CHD1 or the purified double-chromodomain binds selectively to di- or 
trimethylated NS1 peptide, but not to unmodified or acetylated NS1 
tail (Fig. 1d). 

To start assessing the role of the NS1 histone mimic in the regu- 
lation of chromatin complexes, we focused our attention on hPAFIC. 
We chose this on the basis of several factors. First, hPAF1C is a potent 
regulator of RNA elongation”, a co-transcriptional process that con- 
tributes significantly to pathogen-induced gene expression'’”. 
Second, PAF1C has been implicated in the regulation of stress-induced 
genes in yeast**® that could be functionally equated to pathogen- 
induced genes in mammalian cells. Third, opposite to CHD1 that binds 
to the NS1 tail in a methyl-dependent fashion, purified hPAF1C binds 
to unmodified as well as modified NS1 tail (Supplementary Fig. 3a, left 
panel), thus making this interaction potentially more versatile. The 
interaction of hPAFIC with the NS1 tail is sequence-dependent. 
Neither NS1 tail of the H5N1 influenza virus, the scrambled NS1 
peptide, nor the truncated NS1 tail that lacks the H3K4-like sequence 
were able to bind to hPAF1C (Supplementary Fig. 3b). 

hPAF1C consists of six distinct subunits (PAF1, LEO1, CDC73 (also 
called parafibromin), SKI8 (also called WDR61), CTR9, RTF1) that 
have different roles in the interaction between hPAF1C and its binding 
partners, including RNA polymerase II (RNA Pol II)***?®. To identify 
the primary NS1-tail-interacting hPAF1C subunit(s), we tested 
binding of the NS1 tail to purified recombinant individual hPAF1C 
subunits. We found that human PAF1 is the primary binder of the 
unmethylated or methylated NS1 tail, but not the control peptide that 
lacks the histone mimic (Fig. le, left panel). PAF1 does not bind to 
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acetylated NS1 tail, thus pointing to a possible in vivo regulation of 
NS1 binding to PAF1 by a methyl/acetyl-switch mechanism. Finally, 
NS1 interacts with PAF1 in vivo. Immunoprecipitation of NS1 or 
PAF] from infected cell extract revealed the existence of an NS1- 
PAF1 complex (Supplementary Fig. 3c). Our findings show that direct 
binding of the NS1 histone mimic to PAFI is responsible for the 
association of NS1 with hPAFIC. 

Interaction of hPAFIC with the NS1 histone mimic raised the 
possibility of hPAF1C binding to the histone H3 tail. Although the 
interaction of hPAF1C with RNA Pol II and other transcriptional 
regulators is well established****”, the direct association between 
hPAFIC and histone H3 has not been documented. We found that 
unmodified and methylated histone H3 tail peptides, but not the 
scrambled control peptide, bind to hPAF1C (Supplementary Fig. 3a, 
right panel) or to purified PAF1 protein (Fig. le, right panel). Similar to 
the NS1 histone mimic, the acetylated histone H3 tail did not bind to 
either hPAF1C components. 

The ability of homologous tails of NS1 and histone H3 to interact 
directly with hPAF1C indicated a possibility of virus-mediated hijack- 
ing of the transcription elongation machinery from the host genes. We 
found that NS1 accumulates in the nuclei of the infected human cells 
(Supplementary Fig. 4a), where it reaches approximately 5 X 10° 
molecules per nucleus within 12h after infection (Supplementary 
Fig. 4b). The salt-extraction profile of NS1 from nuclei of infected cells 
showed association of NS1 with chromatin (Supplementary Fig. 4c). 
To assess the position of NS1 on chromatin during viral infection, we 
generated a recombinant virus (Flag-NS1) that expresses Flag-tagged 
NS1 protein (Supplementary Fig. 5a, b). Genome-wide analysis of the 
Flag-NS1 binding at 12 h after infection revealed the presence of NS1 
on induced antiviral gene promoters (Supplementary Tables 1 and 2). 
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Figure 2 | Functional interaction between NS1 and PAF1 in infected cells. 
a, The ChIP-seq profiles show the distribution of indicated proteins at inducible 
genes before (black line) and after (red line) infection. The induced genes were 
revealed by RNA-seq and ChIP-seq analysis of infected A549 cells 
(Supplementary Tables 1 and 2). TSS and TES, the transcriptional start and end 
sites, respectively. b, The NS1 levels at gene promoters in PAF1- or CHD1- 
deficient cells (blue and green lines, respectively). The scrambled siRNA- 
treated cells (red line) were used as control. The insert shows knockdown of 
PAF1 or CHD1 in A549 cells. c, PAF1, RNA Pol II and H3K4me3 levels at the 
TSS and TES of the induced genes in uninfected (ui) cells, cells infected with the 
wild-type (WT) or PAF1-binding mutant virus (APAF). Data are 
representative of three independent experiments; error bars show the s.e.m. 
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In infected cells, NS1 is enriched at promoters characterized by the 
presence of H3K4me3 and RNA Pol II (Fig. 2a). The pattern of NS1 
distribution on its targets parallels the distribution of hPAF1C 
(Fig. 2a). Collectively, these data revealed a strategic position of NS1 
at sites of active antiviral gene transcription. 


NSI1 histone mimic controls antiviral gene expression 
The interaction between the NS1 tail and hPAFIC has two comple- 
mentary functions that may support viral infection. The virus probably 
uses the NS1-hPAF1C interaction to target sites of active transcription. 
The short interfering RNA (siRNA)-mediated PAF1 deficiency 
resulted in 30% reduction of NS1 levels at the transcriptional start sites 
of the NS1-bound genes in infected cells (Fig. 2b). The specific role of 
PAF1 in recruitment of NS1 was supported by unaltered NS1 binding 
to its gene targets in cells treated with CHD1-specific siRNA (Fig. 2b). 
These findings revealed a previously unknown ability of the influenza 
virus to use host PAF1 to position viral NS1 protein at sites of active 
gene transcription. 

Once targeted to transcriptionally active loci, as defined by RNA 
Pol II levels cross-referenced to the infected cell transcriptome (Sup- 
plementary Tables 1 and 2), NS1 interferes with PAF1 and RNA Pol II 
abundance at target genes. As exemplified by virus-induced IFIT1 and 
IFI6 genes, infection with the wild-type influenza virus resulted in 
decreased PAF1 and RNA Pol II levels at the transcriptional end sites 
(TES) and to a lesser extent at the transcriptional start sites (TSS) of 
the genes, as compared with PAF1-binding mutant virus (APAF) 
(Fig. 2c). This pattern of PAF1, RNA Pol II and H3K4me3 distri- 
bution is indicative of impaired transcriptional elongation. 

To determine the impact of the NS1 tail on the dynamics of antiviral 
gene transcription, we conducted a genome-wide nuclear ‘run-on’ 
analysis (global run-on sequencing, GRO-seq)” of cells infected with 
the wild-type or mutant viruses that cannot bind to human PAF1 
(APAF). In the absence of infection, we observed accumulation of 
RNA species that map to the TSS of antiviral genes and low levels of 
RNAs that map to the gene body (Fig. 3a). Accumulation of TSS- 
associated short RNAs is a common feature of rapidly inducible genes 
that maintain a paused/poised state in the absence of an inducer*’. 
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Figure 3 | NS1 suppresses antiviral gene transcription in infected cells. 

a, Left: the GRO-seq profile of inducible RNA transcripts in uninfected (ui) 
A549 cells (black line) or cells infected with wild-type or APAF virus (green and 
red lines, respectively). Right: GRO-seq profile of IFIT1 and IFI6 genes in 
uninfected and infected cells. b, GRO-seq profile of A549-expressed genes that 
are not affected by virus infection (left panel) or of the HPRT1 gene (right 
panel). Reads from either DNA strands are indicated as +/— . The y axes 
display reads per million mapped reads per 25 bp. 
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Infection with the APAF virus resulted in a marked increase in levels of 
RNA transcripts that map to the TSS as well as to the downstream gene 
region. The high abundance of transcripts associated within the gene 
body is indicative of effective RNA elongation, which has an essential 
role in regulation of the pathogen-driven host response’’”’. Compared 
to cells infected with APAF virus, the cells infected with wild-type virus 
displayed slightly reduced levels of TSS-associated RNAs but greatly 
diminished levels of transcripts associated with the gene body and 3’ 
end region (Fig. 3a). This pattern is indicative of impaired transcrip- 
tional elongation and is consistent with defective elongation in other 
experimental settings***’. The differences between the impact of APAF 
and wild-type viruses on the global dynamics of antiviral gene tran- 
scription are exemplified by differences in transcription of selected 
antiviral genes (Fig. 3a, right panel). Infection with either wild-type 
or APAF virus has no effect on transcriptional dynamics of housekeep- 
ing genes and genes not affected by viral infection (Fig. 3b). 

The ability of NS1 to control directly hPAF1C-dependent tran- 
scriptional elongation was further supported by the analysis of the 
impact of the NS1 tail on hPAF1C-driven RNA elongation in vitro. In 
this assay, a chromatinized DNA template, supplemented with selected 
transcription factors and histone-modifying enzymes, supports 
hPAF1C-mediated transcription in a fashion that faithfully reproduces 
transcriptional elongation in vivo (Fig. 4a). We found that NS1 has a 
negative impact on hPAFIC function (Fig. 4b). The comparison 
between the amount of in vitro synthesized RNA in the absence or 
presence of hPAFIC shows the ability of hPAF1C to boost RNA syn- 
thesis (Fig. 4b). Addition of purified NS1 (Supplementary Fig. 5c) to 
the transcription reaction reduced the amount of synthesized RNA to 
background levels (Fig. 4b). Addition of purified NS1 that lacks the 
PAF1-binding sequence had no effect on RNA synthesis (Fig. 4b). Both 
wild-type NS1 and NS1 that lacks the PAF1 binding sequence have no 
impact on transcription of non-chromatinized template in vitro 
(Supplementary Fig. 6). This selective and direct impact of NS1 on 
virus-induced gene expression has been further supported by a lack of 
NS1 requirement for the association between the viral and the host 
RNA polymerases in infected cells“**° (Supplementary Fig. 7). 

Our findings provide unambiguous proof for the ability of the NS1 
tail to interfere directly with hPAF1C-driven transcription of antiviral 
genes. 


hPAFIC controls antiviral gene expression and infection 
The significance of NSI-hPAFIC interaction during infection was 
evaluated by two independent experimental approaches. Deletion of 
the hPAF1C binding sequence in NS1 resulted in reduced viral titres 
of the APAF virus as compared to those of the virus carrying full- 
length NS1 (Supplementary Fig. 8). This finding is in agreement with 
the previously reported positive role of the NS1 carboxy terminus in 
influenza A virulence*’. 

We also revealed the central role of hPAFIC in transcriptional 
response induced by various viruses, as well as by the viral RNA 
analogue poly(I:C) and type I interferon. To study the impact of 
PAF1 on influenza-induced antiviral gene expression we used the 
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Figure 4 | NS1 inhibits transcriptional elongation in vitro. a, The full-length 
NS1 protein (NS1) or NS1 lacking the PAF1-binding sequence (NS1(APAF)) 
(Supplementary Fig. 5c) was added to the RNA elongation reaction as 
indicated. b, The amount of the 390-nt RNA elongation product was quantified 
by Image]. The results of two independent experiments are shown. 
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NS1-deficient influenza virus (PR8/ANS1) that elicits an extremely 
potent antiviral response and is therefore particularly useful to study 
host factors that regulate the antiviral response’*”*. Infection with PR8/ 
ANS1 of control siRNA-treated cells resulted in a strong upregulation 
of numerous antiviral genes (Fig. 5a). This response was reduced in 
PAF 1-deficient cells (Fig. 5a, Supplementary Fig. 9, left panel, and 
Supplementary Table 3). The negative effect of PAF1 deficiency on 
antiviral response is not a consequence of impaired virus life-cycle 
caused by a reduction in viral gene expression (Supplementary Fig. 10). 

PAF1 deficiency also attenuated the transcriptional response to 
wild-type influenza A H1N1 (Supplementary Fig. 11, left panel). The 
general positive role of PAF] in antiviral gene expression was under- 
scored by reduced expression of antiviral genes in PAF1-deficient cells 
infected with vesicular stomatitis virus (VSV) as well as cells treated 
with poly(I:C) (Supplementary Fig. 11 and Supplementary Tables 
4-6). PAF1 deficiency also downregulates the transcriptional response 
to type I interferon that governs the antiviral immunity (Fig. 5b, Sup- 
plementary Fig. 9, right panel and Supplementary Table 7). The 
impaired antiviral response by PAF1-deficient cells contributes to a 
nearly tenfold increase of NS1-deficient PR8/ANS1 virus replication, 
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NFKBIZ PLEKHA4 
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PRIC285 IFl44 
CCL3L1 Log,(fold over siCtrl) 
IFI44L 
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IF144 
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ea Category P-value 
LOC728835 Infectious disease 2.08 x 10°8 
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ARL4A 
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Cellular proliferation 9.43 x 10°8 0 20 40 60 


Immune cell trafficking 1.81 x 10-7 Time after infection (h) 


Figure 5 | PAF1 controls antiviral response. a, b, The expression levels of 
mRNAs in influenza infected (a) or IFN-$1-treated (b) control (siCtrl) or 
PAF 1-deficient (siPAF) A549 cells. The tables show the siPAF-affected gene 
categories. Ut, untreated with siRNA. c, Dynamics of virus replication in 
control or PAF1-deficient A549 cells. p.ftu. plaque-forming units. Data are 
representative of three independent experiments. Error bars show the s.e.m. 
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as compared to infected control cells (Fig. 5c). The impact of PAF1 
deficiency on antiviral gene expression is selective. PAF1 deficiency 
does not alter the expression of housekeeping genes (Supplementary 
Fig. 12 and Supplementary Table 8). 


Concluding remarks 


We have shown that H3N2 influenza A virus interferes with host gene 
expression by exploiting the very basic principles of the epigenetic 
control of gene regulation. By mimicking the histone H3K4 sequence, 
which has a key role in positive regulation of gene transcription, the 
influenza virus gains access to histone-interacting transcriptional 
regulators that govern inducible antiviral gene expression. The 
histone-like unstructured NS1 tails are present in the majority of 
human influenza A isolates, but their primary sequence differs 
between viruses. The presence of potentially reactive amino acid 
residues within the unstructured NS1 tails and the nuclear localiza- 
tion of NS1 predict a multitude of nuclear proteins interacting with 
influenza NS1 in a sequence- and/or modification-specific fashion. 
In extension of the histone code that has provided a conceptual 
framework for understanding chromatin-based gene regulation”, 
we propose the existence of the ‘NS1 code’ that guides influenza virus 
interference with host chromatin. The interference with chromatin 
function may complement other immunosuppressive functions of 
NS1 and provide the influenza virus with an opportunity to affect 
host gene expression in a highly selective fashion by recognizing and 
using epigenetic patterns of the host cells. This mechanism may con- 
tribute to the influenza-variant-specific pathogenesis and disease pro- 
gression that differ between viruses carrying the histone-like NS1 tail 
and those viruses, such as human H1N1 (including pandemic 2009 
virus), that lack the NS1 tail. It is possible that by targeting the 
actively transcribed antiviral genes, the NS1 tail helps influenza to 
balance its virulence against the preservation of the infected cell func- 
tion. Such a mechanism could help the virus to maintain its long-term 
presence within the human population. Our findings could also be 
applied to the development of novel anti-influenza therapies that will 
interfere in a highly selective fashion with the NS1 tail binding to its 
chromatin partners. Finally, identification ofhPAF1C as a key mediator 
of antiviral and pro-inflammatory gene response may pave the way for 
the development of anti-inflammatory therapies that—similar to the 
recently described ‘epigenetic’ therapies*°—will interfere selectively 
with hPAFIC binding to its partners. 


METHODS SUMMARY 


The methylation and acetylation in vitro has been performed as described previ- 
ously'*”®, The analysis of NS1 methylation or acetylation in vivo has been done by 
using custom-made methyl-specific or commercial acetyl-specific antibodies”. 
Isolation of the NS1 binding proteins, their analysis by mass-spectrometry and 
western blotting were performed according to a protocol used to identify histone- 
bound proteins**. The recombinant hPAFI1C and its individual components were 
prepared as described before”*. Interaction between hPAFIC and its individual 
components with peptides were performed as reported***’. The transcriptional 
assays have been performed as described*’. Genome-wide distribution of NS1 and 
other chromatin-bound proteins were accessed by ChIP-sequencing as previ- 
ously described’, and analysed using standard bioinformatic approaches. 
Recombinant influenza viruses were generated using standard reverse genetics 
approaches’*'”. Infection of cells with viruses was performed according to the 
standard methods’’”° and viral titres were quantified by plaque assay. Gene 
expression analysis has been done by microarray using Iumina Human HT-12 
v4 Expression BeadChips or by RNA-sequencing and verified by quantitative real- 
time PCR’. GRO-sequencing was performed as described*'. All methodological 
details are additionally outlined in Supplementary Information. 
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Tissue factor and PARI promote microbiota-induced 
intestinal vascular remodelling 


Christoph Reinhardt’, Mattias Bergentall'?, Thomas U. Greiner’, Florence Schaffner*, Gunnel Ostergren-Lundén’”, 
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The gut microbiota is a complex ecosystem that has coevolved with 
host physiology. Colonization of germ-free (GF) mice with a micro- 
biota promotes increased vessel density in the small intestine’, but 
little is known about the mechanisms involved. Tissue factor (TF) is 
the membrane receptor that initiates the extrinsic coagulation path- 
way’, and it promotes developmental and tumour angiogenesis**. 
Here we show that the gut microbiota promotes TF glycosylation 
associated with localization of TF on the cell surface, the activation 
of coagulation proteases, and phosphorylation of the TF cytoplasmic 
domain in the small intestine. Anti-TF treatment of colonized GF 
mice decreased microbiota-induced vascular remodelling and 
expression of the proangiogenic factor angiopoietin-1 (Ang-1) in 
the small intestine. Mice with a genetic deletion of the TF cytoplasmic 
domain or with hypomorphic TF (F3) alleles had a decreased 
intestinal vessel density. Coagulation proteases downstream of TF 
activate protease-activated receptor (PAR) signalling implicated in 
angiogenesis’. Vessel density and phosphorylation of the cytoplasmic 
domain of TF were decreased in small intestine from PARI -deficient 
(F2r‘~) but not PAR2-deficient (F2rl1~’~ ) mice, and inhibition of 
thrombin showed that thrombin-PAR1 signalling was upstream of 
TF phosphorylation. Thus, the microbiota-induced extravascular 
TF-PARI signalling loop is a novel pathway that may be modulated 
to influence vascular remodelling in the small intestine. 

The mammalian intestine is an organ with marked postnatal vascular 
adaptation, which is induced at weaning and coincides with the 
development of an adult microbiota. In agreement with early studies 
showing that the gut microbiota affects vascular remodelling in the 
intestine’, we showed significant increases in villus width in the small 
intestine of conventionally raised (CONV-R) mice in comparison with 
GF mice (Fig. 1a), suggesting a link between vascular remodelling and 
altered villus architecture on colonization. We also showed increased 
staining and messenger RNA levels of the vascular marker platelet- 
endothelial cell adhesion molecule 1 (PECAM-1) in the small intestine 
of both CONV-R and conventionalized (CONV-D; GF mice that had 
been colonized for 14 days with a normal microbiota from a CONV-R 
mouse) mice in comparison with GF mice (Fig. 1b-d). The increased 
vessel density was located to the mid-distal part of the small intestine 
(Supplementary Fig. 1). Staining for the tip-cell marker delta-like ligand 
4 (DIl4)° indicated that colonization initially promoted sprouting 
angiogenesis but that the number of tip cells returned to basal levels 
once villus remodelling was complete (Fig. le). 

We found increased levels of mRNA for Ang-1 as well as increased 
phosphorylation of the Ang-1 receptor Tie-2 in the small intestine of 
CONV-R in comparison with GF mice (Fig. 1f,g), thus providing a 
potential mechanism for microbiota-induced vascular remodelling. 
Consistent with increased vessel density, vascular endothelial growth 
factor receptor 1 (VEGFR-1) expression was also higher in CONV-R 
mice, but there were no changes in any other components of the VEGF 


pathway (Supplementary Fig. 2). The Ang-1-Tie-2 axis promotes the 
remodelling and sprouting of blood vessels”*. To confirm a role for 
Ang-1 in microbiota-induced vascular remodelling, we injected GF 
mice with the specific Ang-1 inhibitor mL4-3 before and during a 
14-day colonization with a normal gut microbiota and showed 
decreases in Tie-2 phosphorylation and intestinal vessel density 
(Fig. 1h-j). We identified the epithelium as a source of Ang-1, because 
its expression was increased in isolated primary enterocytes from 
CONV-R mice in comparison with those from GF mice (Fig. 1k). 

Angiogenesis is linked to the cellular initiation of coagulation, and 
TF signalling has been shown to modulate angiogenesis**. Because 
bacterial components are known to stimulate the coagulation system’, 
we speculated that TF could have a function in microbiota-induced 
angiogenesis in the intestine. In agreement with earlier studies of TF 
localization in humans’? and mice", we identified TF predominantly 
in enterocytes of the villi of small intestine in both GF and CONV-R 
mice (Supplementary Fig. 3). We injected GF mice with anti-TF 
antibody or control IgG before and at 4 and 9 days after colonization 
with a normal caecal microbiota. Tissues were harvested 14 days after 
colonization, and we confirmed that the injected antibodies localized 
to the small intestine (Supplementary Fig. 4). Anti-TF treatment did 
not affect PECAM-1 staining in GF mice that were not colonized 
(Fig. 11,m); neither were levels of VEGF-A, VEGFR-2 or VEGFR-3 
mRNA in CONV-D mice affected (Supplementary Fig. 5). However, 
anti-TF treatment decreased villus width (Fig. In), vessel density 
(Fig. lo, p) and expression of PECAM-1 and Ang-1 mRNA (Fig. 1q, r) 
in CONV-D mice, suggesting that TF promotes microbiota-induced 
remodelling of the villus vasculature. Similarly, the vessel density 
was decreased in intestines from mice expressing low levels of 
human TF (low-TF mice)? compared with mice expressing normal 
levels of human TF from a knocked-in minigene’’? (Supplemen- 
tary Fig. 6). Because neither humanized mouse strain expressed 
alternatively spliced TF, intestinal vascular remodelling seems to be 
independent of alternatively spliced TF™. 

Paneth cells have been suggested to regulate microbiota-induced 
intestinal angiogenesis in mice, but they also have a large effect on 
angiogenesis independently of colonization status’. Anti-TF treatment 
did not decrease the number of Paneth cells or mRNA levels of Paneth 
cell-derived cryptdin 2 in CONV-D mice (Supplementary Fig. 7), indi- 
cating that treatment with antibody has no cytotoxic effect on Paneth 
cells. In addition, vessel density was similar in colonized CR2-tox176 
transgenic mice, which lack Paneth cells’*, and their wild-type 
littermates after treatment with anti-TF (Supplementary Fig. 8). 

Next we investigated whether intestinal TF expression and activity 
differed between GF and CONV-R mice. We did not observe any 
differences in intestinal levels of mRNA for TF from the two groups 
of mice (Fig. 2a). In contrast, immunoblot analyses identified two TF- 
reactive bands, one with an apparent molecular mass of 33 kDa that 
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Figure 1 | TF promotes microbe-induced vascular remodelling in the gut. 
a, Villus width of sections of small intestine from GF and CONV-R mice (n = 4 
mice per group). b, PECAM-1 staining (red) of sections of small intestine from 
GF, CONV-R and CONV-D mice. Nuclei were stained with Hoechst nuclear 
dye (blue). c, Quantification of b (n = 7 or 8 mice per group). d, Relative levels 
of mRNA for the vascular marker PECAM-1 in GF, CONV-R and CONV-D 
mice (n = 6 or 7 mice per group). e, Dll4 staining (green) of sections of small 
intestine from GF, ex-GF mice colonized for 3 days (3d), and CONV-R mice. 
Endothelial cells were stained with PECAM-1 (red). Dll4-positive endothelial 
cells per 100 villi were quantified (n = 3 mice per group). f, Relative levels of 
mRNA for Ang-1 in sections of small intestine from GF and CONV-R mice 
(n = 7-11 mice per group). g, Anti-phospho-Tie-2 immunoblot (Y1100 
phosphorylation site) and quantification relative to total Tie-2 of small- 
intestinal lysates from GF and CONV-R mice (n = 5 mice per group). 

h, PECAM-1 staining of sections of small intestine from mice treated with 
control NaCl solution or the Ang-1 neutralizing peptibody mL4-3. 

i, Quantification of h (n = 6 mice per group). j, Relative levels of mRNA for 


was present in both groups and a second band with an apparent 
molecular mass of 46 kDa that was present at higher levels in intestinal 
lysates from CONV-R and CONV-D mice (Fig. 2b, c). The gut micro- 
biota has global effects on protein glycosylation in the small intestine’®, 
which is necessary for the correct cellular localization and function of 
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PECAM-1 in sections of small intestine from CONV-D mice treated with NaCl 
control or mL4-3 (n = 10 or 11 mice per group). The inset shows that mL4-3 is 
a potent inhibitor of Ang-1-mediated Tie-2 phosphorylation. k, Relative levels 
of mRNA for Ang-1 in primary enterocytes from GF and CONV-R mice 

(n = 10 or 11 mice per group). 1, PECAM-1 staining of sections of small 
intestine from GF mice treated with control or anti-TF antibody. 

m, Quantification of 1 (n = 6 or 7 mice per group). n, Villus width of sections of 
small intestine from CONV-D mice treated with control or anti-TF antibody 
(n = 4 mice per group). 0, PECAM-1 staining of sections of small intestine 
from CONV-D mice treated with control or anti-TF antibody. 

p, Quantification of o (n = 7 mice per group). q, r, Relative levels of mRNA for 
PECAM-1 (q) and Ang-1 (r) in small intestine from CONV-D mice treated 
with control or anti-TF antibody (m = 5 or 6 mice per group). Female Swiss 
Webster mice or cells isolated from these mice were analysed in all panels. Scale 
bars, 50 um. Results are shown as means + s.e.m. Asterisk, P < 0.05; two 
asterisks, P< 0.01; three asterisks, P< 0.005; n.s., not significant. 


many proteins including TF procoagulant activity’’. We therefore 
speculated that the 46-kDa TF band resulted from microbiota- 
induced N-linked glycosylation of TF, the primary carbohydrate 
modification of TF'’. In agreement with this, the mannose-binding 
lectin concanavalin A readily detected the 46-kDa form of TF in 
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Figure 2 | The gut microbiota increases TF procoagulant activity and cell- 
surface localization. a, Relative levels of mRNA for TF in sections of small 
intestine from GF and CONV-R mice (n = 7-11 mice per group). b, Anti-TF 
immunoblot of small-intestinal lysates from GF, CONV-Rand CONV-D mice. 
c, Quantification of the 46-kDa TF band shown in b (m = 14-25 mice per 
group). Data are normalized to actin and expressed relative to GF. d, Anti-TF 
immunoblot of primary enterocytes (from GF and CONV-R mice) after 2h of 
culture. e, Anti-TF immunoblots from N-hydroxysuccinimido-biotin-labelled 
primary enterocytes from GF and CONV-R mice. Left: pull-down of proteins 
located on the plasma membrane with NeutrAvidin beads. Right: supernatant 
containing unlabelled proteins. f, Factor Xa activity in small-intestinal lysates 
from GF and CONV-R mice treated with control or anti-TF antibody (1H1; 
n= 4 or 5 mice per group). g, Levels of thrombin-antithrombin (TAT) 
complexes in small-intestinal lysates from GF and CONV-R mice (n = 7 mice 
per group). Female Swiss Webster mice or cells isolated from these mice were 
analysed in all panels. Results are shown as means + s.e.m. Asterisk, P< 0.05; 
three asterisks, P < 0.005; n.s., not significant. 


small-intestinal lysates from CONV-D mice (Supplementary Fig. 9a). 
Treatment with the N-glycosidase PNGase F abolished detection of the 
46-kDa form and generated a partly deglycosylated form with 
increased electrophoretic mobility that was only weakly detected by 
concanavalin A. We also treated primary enterocytes from CONV-R 
mice with the N-glycosylation inhibitor tunicamycin and observed a 
decreased abundance of the 46-kDa form (Supplementary Fig. 9b, c). 
These findings indicate that the gut microbiota promotes N- 
glycosylation of TF. 

Exposure of functional TF on cell surfaces is regulated by basolateral 
sorting in epithelial cells'*. Surface biotinylation followed by biotin 
pull-down of primary enterocytes from GF mice showed that the 
underglycosylated TF was mainly intracellular (Fig. 2d, e). In contrast, 
surface labelling of proteins or carbohydrates showed that enterocytes 
from CONV-R mice had high levels of the fully glycosylated TF on the 
cell surface (Fig. 2d,e and Supplementary Fig. 10). Confocal micro- 
scopy confirmed plasma membrane localization of TF in primary 
enterocytes isolated from CONV-R mice (Supplementary Fig. 11 
and Supplementary Movie). These changes were associated with 
enhanced coagulation activation, as demonstrated by increased TF- 
FVIla-dependent generation of coagulation factor Xa and higher levels 
of thrombin-antithrombin complexes in lysates of small intestine 
from CONV-R in comparison with those from GF mice (Fig. 2f,g). 

Not only does TF initiate coagulation, it also interacts with integrins 
on the extracellular side and regulates integrin function through its 
cytoplasmic domain’’. Proximity ligation and immunoprecipita- 
tion experiments showed increased TF-f, integrin complex formation 
in intestinal tissue from CONV-R mice in comparison with GF 
counterparts (Supplementary Fig. 12a, b). Furthermore, TF—; integrin 
complex formation was decreased by treating CONV-R mice with 
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tunicamycin (Supplementary Fig. 12c). The cytoplasmic domain of TF 
contains a conserved Ser/Thr-Pro phosphorylation site”. Phosphoryla- 
tion of this domain has been observed at sites of neovascularization’; it 
requires surface localization of TF” and regulates integrin function”. 
An antibody directed against the phosphorylated domain of mouse TF 
detected increased phosphorylation in the 46-kDa form of TF in lysates 
of small intestine from CONV-R in comparison with that from GF 
mice (Fig. 3a), and treatment of primary enterocytes from CONV-R 
mice with tunicamycin decreased levels of the phosphorylated 46-kDa 
form of TF (Fig. 3b, c). 

To test directly whether the TF cytoplasmic domain was involved in 
vascular remodelling, we analysed small-intestinal tissue from mice with 
a targeted deletion of this domain (ACT mice)” and age-matched wild- 
type mice. ACT mice had significantly decreased villus vascularization 
(Fig. 3d,e) and decreased expression of mRNA for PECAM-1 and 
Ang-1 in comparison with wild-type mice (Fig. 3f). TF from wild-type 
and ACT mice had similar electrophoretic mobilities (Supplemen- 
tary Fig. 13). These data show that the TF cytoplasmic domain has a 
function in increasing vessel density in the small intestine but that it is 
not required for glycosylation. Anti-TF treatment decreased TF 
phosphorylation but not total TF levels in CONV-D mice (Supplemen- 
tary Fig. 14). These results indicate that the inhibitory effects of anti- 
TF on vascular remodelling are independent of TF downregulation 
but, at least in part, involve inhibition of TF cytoplasmic domain 
phosphorylation. 

TF also mediates signalling through coagulation proteases that 
activate the G-protein-coupled receptors PARI and PAR2 (refs 4, 5). 
We investigated the effect of the gut microbiota on PAR expression 
in small-intestinal tissue. Levels of mRNA for PARI but not those 
for PAR2 were increased in CONV-R mice in comparison with GF 
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Figure 3 | The gut microbiota increases phosphorylation of the cytoplasmic 
tail of TF, which increases vessel density in the intestine. a, b, Anti-phospho- 
TF immunoblot of small-intestinal lysates from GF and CONV-R mice (a) and 
primary enterocytes (from CONV-R mice) incubated for 2 h in the absence and 
presence of tunicamycin (10 umoll~') (b). c, Quantification of b (n = 5 mice 
per group). d, PECAM-1 staining (red) of sections of small intestine from 10- 
12-week-old wild-type (WT) and ACT female mice on a C57B16/J genetic 
background. Nuclei were stained with Hoechst nuclear dye (blue). 

e, Quantification of d (m = 4-6 mice per group). f, Relative levels of mRNA for 
PECAM-1 and Ang-1 in segments of small intestine from WT and ACT mice 
(n = 3 or 4 mice per group). Female Swiss Webster mice or cells isolated from 
these mice were analysed in a-c. Scale bars, 20 um. Results are shown as 
means + s.e.m. Asterisk, P < 0.05; three asterisks, P< 0.005. 
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Figure 4 | PARI activation increases vessel density in the small intestine. 
a, Relative levels of mRNA for PARI and PAR2 in segments of small intestine 
from GF and CONV-R mice (n = 7 or 8 mice per group). b, PECAM-1 staining 
(red) of sections of small intestine from wild-type (WT), F2r/~ and F2rl1/~ 
mice. Nuclei were stained with Hoechst nuclear dye (blue). c, Quantification of 
b(n = 6-9 mice per group). d, e, Relative levels of mRNA for PECAM-1 (d) and 
Ang-1 (e) in segments of small intestine from wild-type, F2r/~ and F2rl1‘~ 
mice (n = 6-9 mice per group). f, g, Anti-TF and anti-phospho-TF 
immunoblots of small-intestinal lysates from WT, Far ‘~ and F2rl1~/~ mice 
(f) and CONV-D mice treated with PBS (control) or hirudin (1 mg/mouse) 


counterparts (Fig. 4a). PARI is abundantly expressed in endothelial 
cells’, but we found that PAR1 was also expressed in enterocytes and at 
higher levels in cells from CONV-R mice than in GF counterparts 
(Supplementary Fig. 15). PECAM-1 staining (Fig. 4b,c) as well as 
levels of mRNA for PECAM-1 and Ang-1 (Fig. 4d, e) were decreased 
in intestinal tissue from PAR1-deficient (F2r/~) but not PAR2- 
deficient (F2rl1~'~) mice, which is in agreement with a study showing 
that thrombin induces PARI1-dependent Ang-1 expression in 
endothelial cells”*. Together, these data show that the microbiota 
induces increased expression of PARI, and that PARI has a role in 
remodelling the vasculature in the small intestine. 

We next investigated the potential interrelation between PARI and 
TF in intestinal tissue. Phosphorylation of TF was decreased in lysates 
of small intestine from F2r'~ in comparison with that from wild-type 
and F2rl1~'~ mice (Fig. 4f), indicating that PARI acts upstream of TF 
phosphorylation. We blocked thrombin and thrombin-dependent 
PARI signalling with hirudin immediately before and during coloniza- 
tion of GF mice for 6h, and observed a striking decrease in TF 
phosphorylation in lysates of small intestine (Fig. 4g,h). We also 
showed that thrombin increased the phosphorylation of TF in primary 
enterocytes (Fig. 4i,j). Taken together, these data suggest that func- 
tional, procoagulant TF is required for the generation of thrombin, 
which in turn activates PARI to promote phosphorylation of the cyto- 
plasmic domain of TF in enterocytes. 

This study has uncovered a novel connection between TF, PARI 
and Ang-1 in modulating vascular remodelling after colonization. Our 


4 | NATURE | VOL 000 | 00 MONTH 2012 


o 


Vascular density per villus (pixels) 


o fF aA DD 
o 82 82 8 
o oo © © 


200 


= 
i=] 
Oo 


Oo 


F2rl1- WT Far’ Ferlt-/- 
kDa 
464 He} Phospho-TF 
46 TF 
x x 
Rar 
© oh 
Fart 
mo} 
= 
j 4 
bb 25 . 
fo) 
es 
& 2.0 
kDa CONV-R 8 
46 L Phospho-TF e415 
g 
464 TF © 1.0 
Ss ‘ . 
FS 2 
s&s & 2 0.5 
& ToT 
2 0 
ra 
eI > RN 
5 A Ro 
ia se Ss 


immediately before colonization and at 2h and 4h after colonization 

(g). h, Quantification of the phospho-TF band shown in g (m = 6 or 7 mice per 
group). i, Anti-TF and anti-phospho-TF immunoblots of primary enterocytes 
(from CONV-R mice) incubated for 2h with human thrombin (50 nmoll_'). 
j. Quantification of the phospho-TF band shown in i (nm = 8 mice per group). 
Female Swiss Webster mice were analysed ina and g-j. Female WT, F2r ‘and 
F2rl1~'~ mice on a C57BL6/J genetic background were used in b-f. Scale bars, 
20 tum. Results are shown as means = s.e.m. Asterisk, P < 0.05; three asterisks, 
P<0.005; n.s., not significant. 


results support a model in which the microbiota induces increased 
glycosylation and surface localization of TF in the small intestine, 
leading to activation of coagulation, PAR1-dependent-phosphoryla- 
tion of the TF cytoplasmic domain, and TF cytoplasmic domain 
signalling linked to Ang-1-dependent vascular remodelling (Sup- 
plementary Fig. 16). This pathway is distinct from established models 
of ocular angiogenesis* or tumour-induced neovascularization, which 
requires the TF-Factor VIla~-PAR2-mediated induction of pro- 
angiogenic chemokines™. We therefore suggest that TF may support 
distinct pro-angiogenic pathways in different tissues. Increased 
vascularization of the villi of the small intestine increases oxygena- 
tion of the villi, which are shortened and widened after colonization. 
This process may promote increased nutrient absorption, which has 
been associated with increased adiposity in CONV-R mice**. Further 
dissection of how TF and PARI mediate postnatal microbiota-induced 
angiogenesis may provide new therapeutic targets for improving 
intestinal homeostasis and modulating the absorptive capacity of 
the gut. 


METHODS SUMMARY 


Mice. GF Swiss Webster female mice were maintained in flexible film isolators 
under a 12-h light cycle and fed with an autoclaved chow diet (Labdiet, St Louis) 
ad libitum. CONV-R Swiss Webster female mice were transferred into identical 
isolators at weaning. Age-matched female ACT (ref. 22), F2r/~ (ref. 26) and 
F2rl1~'~ (ref. 27) mice and wild-type controls on a C57BL/6J background, low- 
TF and TF knock-in mice on a C57BL/6 background”, and transgenic CR2-tox176 
and non-transgenic littermates on a FVB/N genetic background” were also used. 
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Mice were killed at 10-14weeks of age by cervical dislocation or overdose 
anaesthesia; small intestines were removed and divided into eight equal segments. 
The fifth segment was used unless otherwise stated. Animal protocols were 
approved by the Research Animal Ethics Committee in Gothenburg and the 
Scripps Research Institute Institutional Animal Care and Use Committee 
(IACUC). 

Isolation of primary enterocytes. Enterocytes were isolated from the fifth seg- 
ment of small intestine as described previously”*. The cells were cultured for 2h in 
DMEM medium before the beginning of the experiment. 

Administration of TF antibody. Rabbit anti-mouse TF antibody” or rabbit 
anti-mouse IgG (Sigma) (1.33mg per kg body weight) were administered 
intraperitoneally to GF mice before conventionalization with a caecal microbiota 
from a CONV-R donor. Additional antibody injections were given 4 and 9 days 
after colonization. The mice were killed 14 days after colonization. 

Statistics. Data were analysed with Student’s t-test for two sample groups or one- 
way analysis of variance for three sample groups. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Administration of mL4-3. mL4-33 (ref. 30) (2.32 mg per kg body weight) was 
administered subcutaneously to GF mice before conventionalization with a caecal 
microbiota from a CONV-R donor. Additional injections of mL4-3 were given 
three times a week. The mice were killed 14 days after colonization. 
Preparation of intestinal samples. For immunohistochemistry and in situ 
hybridization, the small intestine (divided into eight equal segments) and colon 
were flushed with PBS after excision and opened longitudinally. The tissue was 
fixed overnight in 4% formaldehyde at 4°C, washed three times in PBS, and 
incubated in 10% sucrose in PBS at 4°C. After 3h, the buffer was replaced with 
20% sucrose and 10% glycerol in PBS, and the tissue was incubated at 4 °C 
overnight. Tissues were dried with a paper towel and mounted in OCT on solid 
COQ). Frozen sections 6 tm thick were prepared. 

For mRNA analyses, the segments were frozen immediately at — 80 °C in liquid 
nitrogen. For immunoblots, the fifth segment was flash-frozen and homogenized 
for 10 min in lysis buffer (50 mM Tris-HCl pH 8, 150 mM NaCl, 5 mM EDTA, 1% 
Triton X-100) containing Roche Complete protease and PhosStop phosphatase 
inhibitors (diluted 1:10). The homogenate was incubated for 30 min on ice and 
centrifuged three times at 9,000g for 10 min to remove insoluble cell debris. 
Immunohistochemistry. Sections were incubated for 20 min at 22 °C and blocked 
for 1h with diluted TBST (50 mM Tris-HCl pH 7.5, 150 mM NaCl, 0.1% Triton 
X-100) containing 5% rabbit serum. The blocking solution was removed and the 
following primary antibodies, diluted in the same blocking solution, were added: 
rat anti-mouse PECAM-1 (dilution 1:300; BD, Franklin Lakes), chicken anti- 
cytokeratin 8 (dilution 1:100; Abcam, Cambridge), goat-anti DLL4 (dilution 
1:50; R&D), rabbit anti-mouse TF” (1 pg ml!) and rabbit anti-PAR1 (dilution 
1:300; Sigma). The samples were incubated overnight at 4 °C, washed three times 
for 5 min in TBST and incubated for 1h with secondary antibodies (Invitrogen, 
Carlsbad) at room temerature (rabbit anti-rat Alexa594, dilution 1:800; goat anti- 
rabbit IgG Alexa488, dilution 1:5,000; goat anti-chicken IgG Alexa488, dilution 
1:2,000; all from BD). Nuclei were stained with Hoechst dye (3 1g ml}; Sigma) 
and the sections were washed three times for 10 min in TBST. For detection of 
Paneth cells, fluorescein isothiocyanate-isolectin (10 1g ml~'; Sigma) was used. 
Slides were mounted, and viewed at 20 and X40 magnification with a fluor- 
escence microscope (Axioplan 2 imaging; Zeiss, Oberkochen). Biopix iQ software 
(http://www. biopix.se) was used to quantify PECAM-1 staining in 2-11 villi per 
mouse. Confocal images and three-dimensional reconstructions were obtained 
with a Leica TCS SP5 confocal microscope (Leica, Wetzlar). 

Quantitative reverse transcriptase polymerase chain reaction (qRT-PCR) 
analysis. Total RNA was isolated from small-intestinal tissues and isolated primary 
enterocytes with the RNeasy kit (Qiagen, Hilden). Total RNA (0.5 jig) was reverse 
transcribed (High Capacity cDNA Reverse Transcription kit; Applied Biosystems, 
Foster City) and SYBR green-based qRT-PCR was performed as described previ- 
ously*’. Primers are listed in Supplementary Table 1. 

In situ hybridization. Mouse TF cDNA” was subcloned into pSPT19 for sub- 
sequent in vitro RNA synthesis. Non-radioactive, digoxigenin-labelled sense and 
antisense RNA probes were synthesized with the DIG RNA Labelling Kit (SP6/T7; 
Roche, Mannheim). Tissues were pretreated for 2 min with proteinase K (10 pg ml — ty 
in 50mM Tris-HCl pH 7.5, 5mM EDTA; the reaction was stopped by washing for 
30s in 0.2% glycine in PBS, followed by two additional washing steps in PBS. Tissues 
were fixed for 15 min in 4% paraformaldehyde in PBS, and washed in PBS for 2 min. 
Hybridization solution was added, and tissues were pre-hybridized for 1h at 65 °C. 
RNA probe (8 ng pl! hybridization solution) was added and preheated for 5 min at 
80 °C; 100 ul was added to each slide and incubated overnight at 65 °C in a humidified 
box. Slides were washed three times for 30 min in a preheated washing solution at 
65°C and twice for 30 min in MABT (100 mM maleic acid pH7.5, 150 mM NaCl, 
0.1% Tween 20) at room temperature. Slides were blocked with 2% blocking reagent 
(Roche, Mannheim), 20% heat-inactivated sheep serum (Sigma) in MABT for 1h at 
room temperature. Binding of the RNA template was detected with alkaline- 
phosphatase-conjugated Fab fragments (Roche, Mannheim) and BM Purple. 


Factor Xa activity. Factor Xa activity was measured in small-intestinal lysates as 
described previously’. 

Measurement of TAT complexes. The TAT ELISA Kit (Uscnlife, Guangguguoji) 
was used for determination of the concentration of mouse TAT complexes in 
lysates of small-intestinal tissue. 

Immunoprecipitation. Tissue lysates were incubated for 1 h with anti-mouse TF 
antibody (70 jig ml’; American Diagnostica, Stamford) or anti-integrin B, antibody 
(dilution 1:1001 Cell Signaling, Danvers), and immunocomplexes were precipitated 
by adding 50 ,1l of Protein A-Sepharose fast flow 4B (Sigma). TF and integrin B, 
antigen were detected as described below. 

Glycosidase treatment. Anti-TF precipitates were boiled for 5 min to release the 
captured antigen from the antibody. Samples were cooled to 4°C, and 20 U ml’ 
peptide N-glycosidase F (Sigma) was added for 90 min at 37°C and then boiled 
again for 5min to inactivate the glycosidase. Treatment with O-glycosidase 
(25mU ml |; Merck, Darmstadt) was performed for 3h at 37 °C. 
Immunoblotting. Tissue lysates or immunoprecipitates were separated by using a 
NuPAGE system with MOPS buffer and 10% BisTris gels. Proteins were trans- 
ferred to poly(vinylidene difluoride) membranes (Invitrogen, Carlsbad). The 
membrane was blocked in 5% milk powder (in PBS/Tween) and incubated for 
1.5h in 5% milk powder containing the primary antibody (rabbit anti-mouse TF 
(2.5 wg ml 1. American Diagnostica, Stamford) for immunoprecipitation, rabbit 
anti-mouse TF and rabbit anti-mouse phospho-TF (21g ml~') for specificity 
controls—see Supplementary Fig. 17, rabbit anti-integrin B, (dilution 1:1,000; 
Cell Signaling), rabbit anti-actin (dilution 1:200; Sigma), rabbit anti- phospho- 
Tie2 (dilution 1:250; R&D) and rabbit anti-Tie-2 antibody (dilution 1:250; 
Abcam)). Secondary goat anti-rabbit IgG (horseradish peroxidase-conjugated; 
Santa Cruz Biotechnology, Santa Cruz) was applied for 1h. Alternatively, the 
membrane was incubated with horseradish peroxidase-conjugated concanavalin 
A (Sigma) to detect sugar moieties after immunoprecipitation. Then the membrane 
was first washed for 2 min with PBS and incubated overnight with the lectin solu- 
tion (PBS containing Mg~" and Ca” * ). The next day, the blot was rinsed three times 
with PBS/Tween. Blots were developed with enhanced chemiluminescence solu- 
tions (Amersham Biosciences, Little Chalfont). For densitometric analysis of 
protein bands, the software Multi Gauge V3.0 (Fuji Film, Tokyo) was applied. 
Cell-surface labelling and pull-down with N-hydroxysuccinimido-biotin. For 
amine-reactive biotinylation and isolation of cell surface proteins from isolated 
primary enterocytes, the Cell Surface Protein Isolation Kit (Pierce, Rockford) was 
used. Isolated proteins were separated on a 10% BisTris gel (Invitrogen), and TF 
antigen was analysed by immunoblotting. 

Proximity ligation assay”’. Slides with adhering primary enterocytes were blocked 
and incubated with primary antibodies (monoclonal rat-anti-mouse TF (1H1)**, 
23.4pgml_', provided by Daniel Kirchhofer; rabbit polyclonal anti-integrin B,, 
dilution 1:50; Cell Signaling Technology). Secondary antibodies (anti-rat and 
anti-rabbit) conjugated with unique DNA probes (Olink Bioscience, Uppsala) 
were added. Slides were evaluated with a Leica TCS SP5 confocal microscope. If 
TF and integrin B, antigens are closer than 30 nm, a fluorescence signal can be 
generated. 


30. Falcon, B. L. et al. Contrasting actions of selective inhibitors of angiopoietin-1 and 
angiopoietin-2 on the normalization of tumor blood vessels. Am. J. Pathol. 175, 
2159-2170 (2009). 

31. Stappenbeck, T. S. et a/. Laser capture microdissection of mouse intestine: 
characterizing mRNA and protein expression, and profiling intermediary 
metabolism in secified cell populations. Methods Enzymol. 356, 167-196 
(2002). 

32. Petersen, L. C. et al. Characterization of recombinant murine factor Vila and 
recombinant murine tissue factor: a human-murine species compatibility study. 
Thromb. Res. 116, 75-85 (2005). 

33. Soderberg, O. et al. Direct observation of individual endogenous protein 
complexes in situ by proximity ligation. Nature Methods 3, 995-1000 (2006). 

34. Kirchhofer, D., Moran, P., Bullens, S. & Peale, F.A monoclonal antibody that inhibits 
mouse tissue factor function. J. Thromb. Haemost. 3, 1098-1099 (2005). 


©2012 Macmillan Publishers Limited. All rights reserved 


Lath oes. 


doi:10.1038/nature10894 


The Shigella flexneri effector OspI deamidates 
UBC13 to dampen the inflammatory response 


Takahito Sanadal, Minsoo Kim!, Hitomi Mimuro?, Masato Suzuki’, Michinaga Ogawa’, Akiho Oyama’, Hiroshi Ashida’, 
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Many bacterial pathogens can enter various host cells and then 
survive intracellularly, transiently evade humoral immunity, and 
further disseminate to other cells and tissues. When bacteria enter 
host cells and replicate intracellularly, the host cells sense the invad- 
ing bacteria as damage-associated molecular patterns (DAMPs) 
and pathogen-associated molecular patterns (PAMPs) by way of 
various pattern recognition receptors. As a result, the host cells 
induce alarm signals that activate the innate immune system’. 
Therefore, bacteria must modulate host inflammatory signalling 
and dampen these alarm signals’ *. How pathogens do this after 
invading epithelial cells remains unclear, however. Here we show that 
OspI, a Shigella flexneri effector encoded by ORF169b on the large 
plasmid and delivered by the type III secretion system, dampens 
acute inflammatory responses during bacterial invasion by suppres- 
sing the tumour-necrosis factor (TNF)-receptor-associated factor 
6 (TRAF6)-mediated signalling pathway. OspI is a glutamine 
deamidase that selectively deamidates the glutamine residue at 
position 100 in UBC13 to a glutamic acid residue. Consequently, the 
E2 ubiquitin-conjugating activity required for TRAF6 activation is 
inhibited, allowing S. flexneri OspI to modulate the diacylglycerol- 
CBM (CARD-BCL10-MALT1) complex-TRAF6-nuclear-factor- 
kB signalling pathway. We determined the 2.0 A crystal structure 
of OspI, which contains a putative cysteine-histidine-aspartic acid 
catalytic triad. A mutational analysis showed this catalytic triad to 
be essential for the deamidation of UBC13. Our results suggest that 
S. flexneri inhibits acute inflammatory responses in the initial stage 
of infection by targeting the UBC13-TRAF6 complex. 

The rupture of host cell membranes by invasive bacteria such as 
S. flexneri and Listeria monocytogenes is sensed as DAMPs and triggers 
acute inflammatory responses by activating various cell signals°®. 
Within the cytoplasm, S. flexneri multiplies and subsequently spreads 
to neighbouring cells and, during this dissemination, the bacteria release 
peptidoglycan, lipopolysaccharide and flagellin**’”. These bacterial com- 
ponents are recognized as PAMPs by cytoplasmic pattern recognition 
receptors, such as NOD-like receptors (NLRs), which induce nuclear 
factor-kB (NF-«B)-mediated and mitogen-activated protein kinase 
(MAPK)-mediated inflammatory responses*. To counteract this host 
defence, bacteria have evolved mechanisms to modulate host inflam- 
matory responses’, through delivering several effectors that modulate 
inflammatory signalling by way of the type HI secretion system 
(T3SS)**. To better understand these mechanisms, we searched for 
additional S. flexneri effectors that modulate acute inflammatory res- 
ponses to bacterial invasion, and we found that OsplI has a pivotal role 
(Supplementary Fig. 1). 

To assess the role of OspI, we carried out a comprehensive micro- 
array analysis of HeLa cells infected with each of three S. flexneri strains: 
YSH6000 (wild type), Aosp! (ospI-deficient) or $325 (T3SS-deficient). 


We found that the messenger RNA levels of chemokines (for example, 
interleukin-8 (IL-8), CC-chemokine ligand 20 (CCL20), CXC- 
chemokine ligand 2 (CXCL2) and CCL2) and cytokines (tumour- 
necrosis-« (TNF-«) and IL-6) were greatly increased in AospI-infected 
cells at 60 min after infection (Supplementary Fig. 2a). This elevated 
chemokine and cytokine production was detected as early as 30 min after 
infection (Fig. la and Supplementary Fig. 2b). Increased phosphoryla- 
tion of inhibitor of NF-«B (IkBa) was detected in AospI-infected HeLa 
cells relative to YSH6000-infected HeLa cells as early as 10 min after 
infection (Fig. 1b). The nuclear translocation of NF-«B (p65 subunit) 
was fourfold higher in cells infected with AospI compared with YSH6000 
at 20 min after infection (Fig. 1c and Supplementary Fig. 3), suggesting 
that OspI can dampen the acute inflammatory response. 

We next tested whether OspI inhibits NF-«B activation when 
S. flexneri infects cells, and we found that ectopic OspI expression further 
inhibited NF-«B activation on S. flexneri infection (Fig. 1d). When 
S. flexneri infects epithelial cells, NOD1-RIP2 (nucleotide-binding 
oligomerization domain 1-receptor-interacting serine/threonine kinase 
2)-dependent and NOD1-RIP2-independent NF-«B signalling path- 
ways are both activated. PAMPs stimulate the dependent pathway"®, 
whereas DAMPs stimulate the independent pathway®. Therefore, we 
investigated the levels of NF-«B activation in HeLa cells transiently 
expressing OspI (designated HeLa/OspI cells) or a mock control. 
When HeLa/Ospl cells were stimulated with TNF-«, NOD1 or phorbol 
myristate acetate (PMA), an examination of the levels of activated NF- 
«B revealed that OspI suppressed PMA-mediated NF-KB activation but 
not TNFa- or NOD1-mediated NF-«B activation (Fig. 1d). 

Because PMA is a substitute for diacylglycerol (DAG) in the activa- 
tion of the protein kinase C (PKC)-NF-«B pathway", and because 
DAG in the host membrane can trigger antibacterial autophagy against 
Salmonella enterica serovar Typhimurium”, we examined the mem- 
brane ruffles protruding around S. flexneri entry sites in cells expres- 
sing PKC-C1-GFP (the PKC-Cl1 region fused to green fluorescent 
protein (GFP) as a DAG sensor)'*. We found that DAG accumulated 
around the bacterial entry site (Supplementary Fig. 4a). Indeed, the 
increased IL8 mRNA production in AosplI-infected cells was sup- 
pressed by treating the cells with propranolol, a DAG inhibitor 
(Supplementary Fig. 4b). 

The DAG-NF-«B pathway is mediated through the CBM complex, 
a major regulator of NF-«B signalling in lymphoid, myeloid and non- 
myeloid cells in innate and adaptive immune responses*. We examined 
whether OspI modulates CBM-complex-mediated NF-«B signalling, 
through knocking down BCL10 production with shorting interfering 
RNA (siRNA), and we found that the IL8 mRNA levels were greatly 
decreased compared with cells treated with control siRNA (Fig. le and 
Supplementary Fig. 4c). GFP-MALT1 was recruited to the S. flexneri 
entry point in HeLa/GFP-MALT1 cells because MALT1 functionally 
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Figure 1 | OspI inhibits DAG-mediated NF-«B activation during S. flexneri 


infection. a, The TNFA and IL8 expression levels in HeLa cells at 0.5, 1 and2h 
post infection with YSH6000, AospI or $325. The white bar (—) indicates no 
infection. b, Phosphorylation of IkBa and JNK1/2 in HeLa cells infected with 
YSH6000 or AospI at various times after infection, as detected with the 
indicated antibodies. pIxkBu, phosphorylated IkBo; pJNK1/2, phosphorylated 
JNK1/2. ¢, Nuclear localization of p65 in HeLa cells infected with YSH6000 or 
AospI at 20 and 40 min post infection. d, The luciferase activity of NF-«B at 90 
min post infection is shown. NS, not significant. e, The IL8 expression levels at 
60 min post infection in HeLa cells transfected with BCL10-directed siRNA and 
infected with YSH6000 or AospI. Data are presented as the mean = s.d.; n = 3; 
* P<0.05. si-luc, siRNA-luciferase. 


interacts with BCL10 (Supplementary Fig. 4d). These results suggested 
that S. flexneri-induced changes in the host cell membrane stimulate 
the DAG-CBM-complex-NF-«B pathway and that Ospl specifically 
dampens this pathway. 

To gain further structural and functional insight, we determined the 
crystal structure of recombinant S. flexneri OspI at 2.0 A resolution 
(Supplementary Table 1; Protein Data Bank (PDB) ID 3B21). OspI has 
an o/B fold with four B-strands (81-f4), seven o-helices (%1-«7) and 
one 3} helix (Fig. 2a, b). 

A search of known structures in PDB revealed that OspI shares 
structural homology with a cysteine protease family and is most closely 
related to AvrPphB (Supplementary Fig. 5). AvrPphB is a Pseudomonas 
syringae effector that is delivered by way of a T3SS and has been 
classified into a superfamily of enzymes containing cysteine proteases, 
acetyl transferases, deamidases, transglutaminases and ubiquitin 
carboxy-terminal hydrolase’*’®. Although there is considerable diver- 
gence in the overall folding across this superfamily, a core anti-parallel 
B-sheet and an o-helix that contains the active site cysteine, which 
packs against the f-strands, are conserved across this family (Sup- 
plementary Fig. 6). A potential catalytic triad (C62, H145 and D160) 
in OspI was identified based on a comparison with the active site of 
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AvrPphB (Fig. 2c and Supplementary Figs 5 and 6a). Although overall 
amino acid sequence similarity was low between OspI and these 
proteins, superimposing H145 and D160 of OspI onto H212 and 
D227 of AvrPphB or other members of this superfamily revealed 
remarkable similarity'’~* (Fig. 2c). 

C62 of OspI, however, existed in three discrete conformations in the 
crystal structure, and the Sy position was located on the opposite side of 
the active site in AvrPphB. The fractional occupancy of each conformer 
was estimated to be 0.55 (conformation A), 0.35 (conformation B) and 
0.1 (conformation C). The highest occupancy site of C62 appeared to 
form a disulphide bond with C65 at 2.05 A (Fig. 2c, d and Supplemen- 
tary Fig. 7). Alternative conformations of a deamidase active site 
cysteine are also seen in cytotoxic necrotizing factor 1 (Cnfl1)*. 

To characterize the putative catalytic triad, we substituted C62, 
H145 and D160 with serine (C62S) or alanine (C62A, H145A and 
D160A), and the mutant constructs were tested for their ability to 
suppress NF-«B activation. Complementing the AospI mutant with 
plasmids carrying the ospI-C62S, ospI-H145A or ospI-D160A genes 
failed to mitigate the increased IkBa phosphorylation and IL8 induc- 
tion induced by AospI infection (Fig. 3a, b and Supplementary Fig. 8c). 
OspI-C62S, OspI-H145A and OspI-D160A lost the ability to suppress 
NF-«B activity on S. flexneri infection or PMA stimulation (Sup- 
plementary Fig. 8a, b), indicating that C62, H145 and D160 in OspI 
compose the catalytic triad that suppresses NF-KB signalling. 
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Figure 2 | Crystal structure of S. flexneri OspI. a, Overall structure of S. 
flexneri OspI. The secondary structure elements are coloured as follows: 
a-helices, cyan; B-strands, dark pink; and loops, light pink. b, The amino acid 
sequence of OspI. Secondary structure elements are coloured as in a. The 
putative active site residues are shown in red. c, Alignments of the catalytic 
cores of OspI with P. syringae AvrPphB and other members of the superfamily 
to which AvrPphB belongs, using the putative catalytic triad residues. All atoms 
of histidine and the main chain atoms of aspartic acid are shown as a reference. 
Shown are S. flexneri OspI, AvrPphB (PDB ID 1UKF), CHBP (Cif homologue 
from Burkholderia pseudomallei; PDB ID 3GQM), NAT (arylamine 
N-acetyltransferase from Salmonella Typhimurium; PDB ID 1E2T), PMT 
(Pasteurella multocida toxin; PDB ID 2EBF), and UCH6 (ubiquitin C-terminal 
hydrolase 6; PDB ID 1VJV). C62 in Ospl is represented in three alternative 
conformations with the three conformers labelled A, B and C. d, A close-up 
view of the putative active site of OspI. OspI is shown with a surface 
representation. The putative active site residues are shown as stick models. 
Hydrogen bonding is indicated by a dashed line. 


©2012 Macmillan Publishers Limited. All rights reserved 


a Hl Mock Ml Wild-type osp/ MH Mutant osp/ b ¥ 
9 g<o< 
[8 mRNA &€S$ 58 8 8 
200 Se— 86 26 
— ose tss! 2 USS: oS 
8 ao & Be 2 se 
= 150 GF 343 3 4 4 
2 
D 
2 
8 100 Anti-plBer — ——— 
3 
2 50 
s Anti-IxBo. | ee eee sees! 
= 0 
THY ¥ BOHM KEK YE 
~3 8889388 8 
£ £ = g g 5 E Anti-actin | ™ eee eee eee ee ee! 
8 8 e238 8588 
S e¢oeeszaQ 
2 27343 8 $b 
= 
= a4 
2 
c Mock Hilosp/ Mic62s 
* * NS NS NS 
160 16 60 30 al 800 am 
foal 
if 3120 12 600 
53 40 20 
2 2B 80 8 400 
25 
o 8 20 10 
« ® 40 4 200 
0 0 0 0 0 
_- _- es _- _ _ 
osp! C62S osp! C62S osp! C62S osp! C62S osp! C62S 
+ BCL10 + TRAF6 + TAK1/TAB1 + IKK-B + p65 
d 


[10 MilysHe000 Hil Aosp/ 
Cxcl2 mRNA 
* 


TRAF6- 
poly- 


Relative expression (fold) 
= nD 
Oo Oo 
am 


ubiquitin 
0 Si _ : =——_TRAF6 
Infection - + + - + + — + + Anti-TRAF6 
Trato-/ Trate-/ Trafe"/ = 
WT-TRAF6 = =mock — TRAF6-C70A Anti-Ospl 


Figure 3 | Involvement of the CHD triad in inhibiting TRAF6-dependent 
NF-«B activation. a, b, JL8 induction (at 60 min) and IkBa phosphorylation 
(at 10 min) in HeLa cells infected with AospI harbouring the indicated ospI 
mutant plasmids. c, NF-«B luciferase reporter assay in HeLa cells transfected 
with the indicated plasmid combinations. d, Traf6-deficient MEFs were 
rescued with the indicated retroviruses. Cxcl2 induction in cells infected with 
YSH6000 or AospI is shown. e, In vitro TRAF6 self-ubiquitylation assay in the 
presence of wild-type OspI or OspI-C62A (at 0.08 or 0.8 |1M, as indicted by the 
wedges). Data are presented as the mean + s.d.; n = 3; *, P< 0.05. 


We determined which steps in the DAG-CBM-complex—NF-«B 
pathway are targeted by OspI by examining NF-kB activity in HeLa/ 
OspI or HeLa/OspI-C62S cells that were transfected with a vector 
expressing the gene encoding BCL10, TRAF6, TAK1/TAB1, IKK-B 
or NF-«B (p65). OspI but not OspI-C62S suppressed NF-KB activity 
when HeLa cells were transfected with the gene encoding BCL10 or 
TRAF6 but not TAKI/TAB1, IKK-f or p65, suggesting that OsplI 
targets TRAF6 or an upstream step in the signalling pathway 
(Fig. 3c). Thus, we tested rescued Traf6-deficient mouse embryonic 
fibroblasts (Trafé ‘~ MEFs) for a possible role of OspI in modulating 
TRAF6 activation during S. flexneri infection: the MEFs tested were 
Trafo‘~/WT-TRAF6 (Trafo ‘~ MEFs expressing wild-type human 
TRAF6), Trafo ' ~ /TRAF6-C70A (an E3-ligase-deficient mutant) and 
Traf6 ‘~/mock. Cxcl2 levels in AospI-infected Traf6-deficient MEFs 
were low and could be rescued by wild-type TRAF, but not by TRAF6- 
C70A or mock treatment (Fig. 3d and Supplementary Fig. 8d). This 
result is consistent with OsplI interference in TRAF6 activation during 
S. flexneri infection. 

TRAF6 is an E3 ubiquitin ligase that cooperates with ubiquitin- 
activating El and ubiquitin-conjugating E2 enzymes (UBC13 and 
UEV1A), which are required for TRAF6 self-ubiquitylation and 
TRAF6-induced NF-«B activation” **. Therefore, we investigated the 
effects of OspI and OspI-C62A on TRAF6 by using a self-ubiquitylation 
assay. We found that OspI, but not OspI-C62A, dampened TRAF6 
polyubiquitylation (Fig. 3e and Supplementary Fig. 9). However, OspI 
did not affect the formation of E2-ubiquitin thioester intermediates 
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(Supplementary Fig. 10), suggesting that OspI modifies TRAF6, 
UBC13, UEV1A or ubiquitin. We incubated OspI with each of these 
putative targets and examined their electrophoretic mobility. We 
found that the mobility of UBC13, but not of the other proteins, shifted 
in an OspI-dose-dependent manner (Supplementary Fig. 11). Indeed, 
OspI could interact with His-UBC13 (Supplementary Fig. 12). 

We then identified two affected peptides (WSPALQIR and 
DKWSPALAQIR, which overlap each other) from a tryptic digest of 
UBC13 by using tandem mass spectrometry, and we found that the 
Q100 residue in UBC13 was deamidated to E100 by Ospl (Fig. 4a and 
Supplementary Fig. 13). To confirm this, we created UBC13-Q100E 
and showed that this modification produced the same mobility shift as 
the modification at Q100 by OspI; treatment with OspI-C62A, OspI- 
H145A or OspI-D160A produced no mobility shift (Fig. 4b, c and Sup- 
plementary Fig. 14). Endogenous UBC13 was consistently deamidated 
in HeLa cells 10 min after infection with YSH6000 and AospI/ospI but 
not AospI or AospI/ospI-C62S (Fig. 4d). 

Importantly, the cocrystal structures of UBC13 and the zinc finger of 
TRAF6 indicated that Q100 of UBC13 was near the catalytic pocket but 
was also located near the interface between UBC13 and the TRAF6 zinc 
finger’. Thus, we further characterized the ubiquitin-conjugating E2 
activity of UBC13-Q100E using a ubiquitylation assay. The efficiency 
of UBC13-Q100E-catalysed ubiquitin chain formation on TRAF6 was 
markedly lower than that of wild-type UBC13 (Fig. 4e). In an NF-«B 
reporter assay, UBC13-Q1O00E acted as a dominant negative because 
UBC13-Q100E suppressed the NF-«B activity that was stimulated by 
PMA, TRAF6 and infection, but not TNF-a, in a dose-dependent 
manner (Supplementary Fig. 15). We thus concluded that OspI 
deamidates Q100 in UBC13, inhibiting the TRAF6-NF-«B pathway. 

Here we identified OspI as a new T3SS effector that specifically targets 
TRAF6-dependent acute inflammatory signalling during S. flexneri 
invasion of epithelial cells. OspI selectively deamidates UBC13, severely 
impairing the E2 ubiquitin ligase activity of UBC13, which is required for 
TRAF6 polyubiquitylation. Thus, S. flexneri can block acute NF-«B- 
mediated inflammatory responses at the early stage of epithelial invasion 
(Supplementary Fig. 1), suggesting that OsplI is a unique T3SS effector 
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Figure 4 | OspI selectively deamidates Q100 in UBC13. a, Liquid 
chromatography and tandem mass spectrometry (LC-MS/MS) spectrum of a 
Q100-containing tryptic peptide in OspI-treated UBC13. b, Native PAGE and 
SDS-PAGE analysis of UBC13 treated with OspI (0.08, 0.8, 8 or 80 UM) or 
OspI-C62A (80 LM). c, Native PAGE analysis of UBC13 treated with Ospl, 
OspI-C62A, OspI-H145A or OspI-D160A. d, HeLa cells were infected with S. 
flexneri/mock, AospI/mock, AospI/ospI or AospI/ospI-C62S. At 10 and 40 min 
after infection, the cell lysates were subjected to native PAGE followed by anti- 
UBC13, anti-IkBa or anti-phospho-IkBa immunoblotting. e, In vitro 
ubiquitylation assay in the presence of UBC13 or UBC13-Q100E. 
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that dampens TRAF6-dependent inflammatory signalling in response 
to bacterial invasion of epithelial cells. 


METHODS SUMMARY 


The S. flexneri strains used in this study are described in the Methods. The procedures 
for the OspI crystallization and structure determination, mass spectrometric analysis, 
microarray analysis, quantitative PCR with reverse transcription, in vitro ubiquityla- 
tion assays, in vitro deamidation assays, luciferase assays, immunostaining, 
immunoblotting, S. flexneri infection and protein expression are described in 
the Methods, together with a description of the antibodies and the materials. 
OspI, reported as ORF169b (NCBI accession number CAC05849), is encoded 
by an S. flexneri virulence plasmid. The S. flexneri ospI gene was disrupted using 
a 4 red recombinase-mediated recombination system as previously described”. 
The AospI strain was complemented by introducing plasmids (pWKS130) encod- 
ing ospI-C62S-myc, ospI-H145A-myc or ospI-D160A-myc. Bacterial infections 
were carried out as previously described”. Briefly, HeLa cells were infected with 
the various S. flexneri strains at a multiplicity of infection (m.o.i.) of 100. For the 
afimbrial adhesin (Afa)-expressing S. flexneri strains, cells were infected at an 
m.o.i. of 10. Twenty minutes after infection, the plates were transferred into fresh 
medium containing 100pgml~' gentamicin, to kill extracellular bacteria. 
Statistical significance was determined using Student's t-test. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Strains and plasmids. S. flexneri strain YSH6000 (ref. 30) was used as the wild- 
type strain. $325 (mxiA::Tn5)*' was used as a T3SS-deficient negative control. The 
vectors pCMV-FLAG-IkB kinase-B (IKK-f), pCMV-FLAG—transforming 
growth factor-B-activated kinase 1 (TAK1), and pCMV-FLAG-TAK]1-binding 
protein 1 (TAB1) were from G. Takaesu*’. TRAF6 (ref. 33) was ligated into the 
plasmids pGEX6P-1, pCMV-FLAG and pMx-puro. Human MALTI and 
human BCL10 cDNAs were ligated into the pCMV-FLAG plasmid. Site-directed 
mutagenesis of ospI, UBC13 and TRAF6 was performed using a QuikChange Site- 
Directed Mutagenesis Kit (Stratagene). 

Antibodies and materials. Antiserum specific for OspI was obtained by 
immunizing rabbits with the S. flexneri OspI peptide (189-207 amino acids, 
VIHVSDQEFDHYANSSSWK) coupled to keyhole limpet haemocyanin using 
m-maleimidobenzoyl-N-hydroxysuccinimide ester. Anti-S. flexnerilipopolysaccharide 
was prepared as previously described**. Anti-phospho-IkBa (Cell Signaling 
Technology), anti-IkBa (BD Transduction Laboratories), anti-phospho-JNK1/2 
(Cell Signaling Technology), anti-JNK2 (Santa Cruz Biotechnology), anti-p65 (F- 
6; Santa Cruz Biotechnology), anti-M2 Flag (Sigma), anti-His (Sigma), anti-actin 
(Millipore), anti- UBC13 (Invitrogen) and anti-ubiquitin (1B3; MBL) antibodies, 
as well as Alexa Fluor 633 phalloidin (Invitrogen), were obtained commercially 
and used as primary antibodies for immunostaining and immunoblotting. TNF-a 
(Peprotech), PMA (Sigma) and propranolol (Sigma) were obtained commercially. 
Plasmid construction, expression and protein purification for crystallization. 
To purify OspI, the corresponding DNA was subcloned into the pGEX6P-1 
glutathione S-transferase (GST) fusion vector (GE Healthcare). The plasmids were 
used to transform the BL21 strain of Escherichia coli, and protein expression was 
induced by adding 0.1 11M isopropyl-B-p-thiogalactoside (IPTG) at 30°C for 6h. 
Whole-cell extracts were prepared by sonicating and incubating the bacteria in 
lysis buffer (20 mM Tris-HCl, pH 7.4, and 150 mM NaCl) for 30 min with rotation 
at 4°C, and then clarified by centrifugation at 27,000g for 20 min. GST fusion 
proteins were purified by incubating the clarified cell extract with Glutathione 
Sepharose 4B. The GST moiety was removed by cleaving with the PreScission Plus 
protease (GE Healthcare) and then performing anion-exchange chromatography 
(HiTrap Q FF, GE Healthcare) and gel-filtration chromatography (Superdex 75, 
GE Healthcare). A selenomethionine (Se-Met) derivative of OspI was prepared by 
expressing the protein in strain B834 E. coli cells that were grown in minimal 
medium supplemented with L-selenomethionine (25 1g ml ') (Calbiochem). 
Crystallization. Native and Se-Met OspI were concentrated to 20mg ml! by 
ultrafiltration in 25 mM Tris-HCl, pH 7.5, and 1 mM dithiothreitol. Crystals were 
grown by the hanging-drop vapour diffusion method at 288 K in drops containing 
a mixture of 2 ul of protein solution and 2 Ul of reservoir solution. For native and 
Se-Met Ospl, the reservoir solution consisted of 0.1 M MES, pH 6.5, 0.1 M sodium 
acetate, 1.0 M NaCl and 24% (w/v) PEG 8000. The crystals of native and Se-Met 
OspI were transferred into a cryoprotective solution containing 0.1M MES, 
pH6.5, 0.1 M sodium acetate, 1.0 M NaCl and 35% (w/v) PEG 8000. 

Data collection. X-ray diffraction data were collected at SPring-8 at 100K in 
BL44XU. Single-wavelength anomalous diffraction (SAD) data were collected at 
a wavelength of 0.9789 A. Native Ospl crystal data were collected at a wavelength 
of 0.9000 A. All data were reduced using the software Denzo, Scalepack** and 
programs from the CCP4 package**. The data-processing statistics are provided 
in Supplementary Table 1. 

Structure determination and refinement. The structure of OspI was determined 
using the SAD method. The positions of the heavy atoms were obtained using the 
software SHELXD” and refined using MLPHARE. The initial SAD phases were 
extended to a higher resolution with diffraction data that were collected from the 
native crystal to 2.0 A resolution, with solvent flattening using PIRATE and DM”. 
The initial model was constructed with the ARP/wARP* program. The remaining 
structural elements were built manually using the Coot” program. The model was 
refined to 2.0 A resolution with the software REFMACS and CNS”. The final OspI 
model shows amino acids 21 to 212. The phasing and refinement statistics are 
summarized in Supplementary Table 1. There are no residues in the disallowed 
regions of the Ramachandran plot. The structure figures were generated using 
CCP4mg"' and PyMOL”. 

Cell culture and retroviral expression. HeLa and 293T cells were cultured in 
minimal essential medium (Sigma) and Dulbecco’s modified Eagle’s medium 
(Sigma), respectively, containing 10% FBS. Traf6-deficient MEFs were as described 
previously*’. Human TRAF6 and the C70A mutant were ligated into pMx-puro. 
The plasmids were transfected into Plat-E packaging cells as previously 
described**. The resultant retrovirus was used to infect Traf6-deficient MEFs 
and selected for puromycin (2 1g ml‘) resistance. 

Luciferase reporter assay. For the luciferase reporter assays, cells were transfected 
with the reporter plasmid (pGL4.32[luc2P/NF-«B-RE/Hygro]; Promega), Renilla 
luciferase constructs (phRL-TK; Promega) and various combinations of expression 
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plasmids, by using FuGENE6 transfection reagent (Roche). The cells were infected 
with S. flexneri (m.o.i of 20) or treated with PMA (50 nM) and TNF-« (2 ng ml — 1) 
for 1.5h. 

siRNA knockdown analysis. HeLa cells were transfected with siRNA (RNAi Co) 
by reverse transfection using Lipofectamine RNAiMAX. The following siRNAs 
were used: human BCL10 siRNA-1 (5'-CCUUAAGAUCACGUACUGUUU-3’ 
and 5'-ACAGUACGUGAUCUUAAGGG-3’); BCL10 siRNA-2 (5'-CGUAC 
UGUUUCACGACAAUGA-3’ and 5'-AUUGUCGUGAAACAGUACGUG-3’); 
and si-luc, 5'-CGUACGCGGAAUACUUCGAdTdT-3’ and 5’-UCGAAGUAU 
UCCGCGUACGdTdT-3’. 

RNA extraction and quantitative RT-PCR analysis. Total RNA was extracted 
using ISOGEN (Nippongene). First-strand cDNA was synthesized from 1 1g total 
RNA with reverse transcriptase using oligo(dT) primers. Real-time PCR was 
performed on cDNA samples using a LightCycler 2.0 (Roche) with the SYBR 
Green system (TaKaRa). The GAPDH expression levels were evaluated as an 
internal control. The following primer pairs were used: human IL8 (5'-CTGA 
TTTCTGCAGCTCTGTGTG-3’ and 5’-GTCCACTCTCAATCACTCTCAG-3’), 
human TNFA (5'-CTTCAGACACCCTCAACCTCTT-3’ and 5'-CACATTCCT 
GAATCCCAGGT-3’), human CCL20 (5'-TTGATGTCAGTGCTGCTACTCC-3’ 
and 5’-CCGTGTGAAGCCCACAATA-3’), human CCL2 (5’-GCTCATAGCA 
GCCACCTTCATT-3’ and 5’-CAGCTTCTTTGGGACACTTGCT-3'), human 
CXCL2 (5'-GGGTGGCAAAGAAAAGGAG-3’ and 5’-GTTGAGCGTCAAGAC 
CCAGT-3’), human IL6 (5'-GATGGCTGAAAAAGATGGATGC-3’ and 5'-CT 
GCAGGAACTGGATCAGGACT-3’), human GAPDH (5'-TGCCCTCAACGA 
CCACTTTG-3' and 5'-TTCCTCTTGTGCTCTTGCTGGG-3’), mouse Cxcl2 
(5'-CAAGGGTTGACTTCAAGAACATCC-3’ and 5’-CCTTGAGAGTGGCTA 
TGACTTC-3’), mouse Gapdh (5'-GIGTCTTCACCACCATGGAG-3’ and 
5'-TCGTGGTTCACACCCATCAC-3’) and human BCL10 (5'-AGCGCGACC 
ATCGGAGAGGT-3' and 5'-TGTGGCCGCAGAATGGCAGG-3’). 
Purification of recombinant proteins and in vitro ubiquitylation assay. E1 
(Boston Biochem) and ubiquitin (Sigma) were obtained commercially. His— 
UEV1A, His-UBC13 and GST-TRAF6 were prepared as previously described”. 
E1 (200 ng), His-UEV1A (400 ng), His- UBC13 (500 ng) and TRAF6 (300 ng) were 
incubated at 30 °C for 20 min in a 50 pil reaction mixture containing reaction buffer 
(20 mM Tris-HCl, pH7.5, 5mM ATP, 5mM MgCl and 0.1 mM dithiothreitol 
(DTT)). 

In vitro deamidation assay and mass spectrometric analysis. UBC13 or ubiquitin 
(12 ug) was incubated with the indicated amount of purified OspI at 30°C for 
30 min in reaction buffer (20mM Tris-HCl, pH7.5, 100 mM NaCl and 0.1 mM 
DTT). The reaction samples were separated by native PAGE, stained with 
Coomassie Brilliant Blue, and then quantified with the program ImageJ. The protein 
bands were excised from the gel, and each gel slice was subjected to in-gel digestion 
with trypsin. The resultant tryptic peptides were injected into a nano-liquid 
chromatography system (DiNa, KYA). The eluent was mixed with the matrix 
solution, and the mixture was directly blotted onto a matrix-assisted laser 
desorption (MALDI) sample plate with a micro-fractionation system (AccuSpot, 
Shimadzu Biotech) for MALDI-time of flight (MALDI-TOF) mass spectrometry 
(MS) analysis. Overall peptide identification was performed using a MALDI-TOF/ 
TOF 4600 proteomics analyser (Applied Biosystems), followed by a database 
search with Mascot (Matrix Science). 

Immunofluorescence analysis. HeLa cells were seeded on cover slips in six-well 
plates. The cells were infected with YSH6000 (wild-type S. flexneri) (m.o.i. of 100) 
for the indicated time periods, washed with PBS and then fixed in 4% 
paraformaldehyde in PBS for 15min. The cells were permeabilized with 0.2% 
Triton X-100 in PBS for 10 min and then blocked with 1% BSA in TBS for 30 min. 
Congo-red-induced secretion. Congo-red-induced secretion of type III effectors 
was performed as reported previously*’. Briefly, S. flexneri incubated at 37 °C for 
4h was stimulated with 0.003% Congo red for 10 min at 37°C. The supernatant 
was recovered by centrifugation. The samples were separated by SDS-PAGE and 
immunoblotted with anti-MYC antibody. 

Microarray analysis. Total RNA was prepared 1 h after infection and analysed by 
a gene microarray (GeneChip Human Genome U133 Plus 2.0 Array, Affymetrix). 
The raw data were normalized and analysed using GeneSpring software. Fold 
changes in the gene expression values (relative to uninfected cells) were log3- 
transformed and visualized as a heat map, indicated by the colour scale. 
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Intrinsic coupling of lagging-strand 
synthesis to chromatin assembly 


Duncan J. Smith! & lestyn Whitehouse! 


Fifty per cent of the genome is discontinuously replicated on the lagging strand as Okazaki fragments. Eukaryotic 
Okazaki fragments remain poorly characterized and, because nucleosomes are rapidly deposited on nascent DNA, 
Okazaki fragment processing and nucleosome assembly potentially affect one another. Here we show that 
ligation-competent Okazaki fragments in Saccharomyces cerevisiae are sized according to the nucleosome repeat. 
Using deep sequencing, we demonstrate that ligation junctions preferentially occur near nucleosome midpoints 
rather than in internucleosomal linker regions. Disrupting chromatin assembly or lagging-strand polymerase 
processivity affects both the size and the distribution of Okazaki fragments, suggesting a role for nascent chromatin, 
assembled immediately after the passage of the replication fork, in the termination of Okazaki fragment synthesis. Our 
studies represent the first high-resolution analysis—to our knowledge—of eukaryotic Okazaki fragments in vivo, and 
reveal the interconnection between lagging-strand synthesis and chromatin assembly. 


During eukaryotic chromosome replication, both genetic and epigenetic 
information must be accurately duplicated. For chromatin architecture 
and modifications to be truly epigenetic—that is, heritable despite not 
being genetically encoded—complete disruption and dissociation of 
nucleosomes from the replication fork must be prevented to allow the 
rapid re-deposition of precisely located and appropriately modified 
histones on the nascent DNA. Histone chaperone complexes govern 
both nucleosome disassembly and assembly at the replication fork’. 
DNA replication is inherently asymmetric. Okazaki fragment syn- 
thesis on the lagging strand necessitates the repeated production of 
single-stranded DNA and polymerization in the opposite direction to 
fork progression. Given the delay in lagging-strand synthesis and the 
rapidity of histone deposition behind the replication fork’, these two 
processes may be interlinked. A coordinated series of events occurs each 
time an Okazaki fragment is synthesized’. Each fragment is initiated via 
an RNA primer* and up to 30 nucleotides of DNA, both synthesized by 
DNA polymerase « (Pol «)-primase’; subsequently, the clamp loading 
factor RFC mediates the replacement of Pol « by the sliding clamp 
PCNA and the processive polymerase Pol 6 (ref. 6). Pol 6 extends the 
nascent DNA chain through the 5’ end of the preceding Okazaki frag- 
ment”; this strand-displacement synthesis generates a 5’ RNA or DNA 
flap that is cleaved by nucleases such as the flap endonuclease Fen1 
(ref. 9). Repeated cycles of extension and DNA cleavage followed by 
Pol 4d idling”® produce a nick that migrates away from the replication fork 
and can be sealed by DNA ligase I'*. Little to no DNA synthesized by the 
error-prone Pol % remains in the genome after replication’’, suggesting 
that strand displacement generally replaces at least 30 nucleotides of 
DNA, but it remains unclear how the replication machinery coordinates 
the transition from Pol 5 to DNA ligase. Indeed, despite their fun- 
damental importance in replication, comparatively little is known about 
the in vivo properties of eukaryotic Okazaki fragments: although they 
are widely accepted to be shorter than the 1-2 kilobases (kb) observed in 
prokaryotes, a clear consensus has not emerged'''*". Moreover, we 
have little information about how nucleosome assembly may interact 
with lagging-strand synthesis. A plausible explanation for the difference 
in size between eukaryotic and prokaryotic Okazaki fragments is that 
eukaryotic DNA replication occurs within the context of chromatin. 


To investigate the relationship between lagging-strand processing 
and chromatin assembly, we have developed strategies to analyse 
Okazaki fragments purified from S. cerevisiae. We find that Okazaki 
fragments are heterogeneously sized with a repeat length correspond- 
ing to that of nucleosomes. In addition, we find that ligation junctions 
occur preferentially around known nucleosome midpoints, and that 
these relationships can be altered by interfering with chromatin 
assembly or Pol 6 processivity. Our data suggest an integrated model 
whereby nascent chromatin structure facilitates Okazaki fragment 
processing, which may provide a means to monitor nucleosome 
assembly behind the replication fork. 


Nucleosome-sized Okazaki fragments 

To enrich for ligation-competent Okazaki fragments, we constructed 
strains of S. cerevisiae in which DNA ligase I'' (CDC9) expression is 
driven from a doxycycline-repressible promoter and the encoded protein 
tagged with a temperature-sensitive degron'®. The inability to ligate 
Okazaki fragments should result in the accumulation of nicked DNA 
that, after purification, can be radiolabelled using DNA polymerase and 
o-**P dCTP. Ligase inactivation in an asynchronous culture resulted in 
the accumulation of short heterogeneous DNA species whose abund- 
ance increased with duration of ligase inhibition (Fig. 1a). Notably, 
these fragments displayed an underlying periodicity reminiscent of 
the nucleosome repeat. Indeed, comparison of the nicked DNA to a 
nucleosome ladder indicated that a significant fraction of fragments 
seem to be sized according to the 165-bp nucleosomal repeat length 
in S. cerevisiae (Fig. 1b). 

To clarify that we were observing Okazaki fragments, we confirmed 
that they did not appear in a Gl-arrested culture until S-phase release 
(Fig. 1c). Additionally, we treated purified DNA with recombinant 
DNA ligase; this repaired the nicks and severely diminished our ability 
to label fragments (Fig. 1d), showing that our assay predominantly 
detects nicked DNA rather than species containing single-stranded 
gaps or flaps resulting from incomplete Okazaki fragment processing. 
Unrelated control experiments indicated that a 125-nucleotide species 
previously reported to be Okazaki fragments'*”’ is 5S ribosomal RNA 
(Supplementary Fig. 2). Thus, we demonstrate that the mono-, di- and 
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Figure 1 | DNA ligase I depletion in S. cerevisiae leads to the accumulation 
of Okazaki fragments sized similarly to the nucleosome repeat. 

a, Transcriptional repression of DNA ligase I (CDC) results in the accumulation 
of nicked DNA. Cells carrying a doxycycline-repressible allele of the CDC9 gene 
were treated with doxycycline (Dox) for the indicated time. Purified genomic 
DNA was labelled using exonuclease-deficient Klenow fragment and o.-*’P 
dCTP and separated in a denaturing agarose gel. nt, nucleotides. b, The size of 
labelled Okazaki fragments mirrors the nucleosome repeat. Okazaki fragments 
(lane 2) were labelled as in Fig. 1a; nucleosomes were prepared from wild-type 
(lane 3) or repressible CDC9 strains (lanes 4 and 5) by in vivo micrococcal 
nuclease (MNase) digestion. The chromatin digestion patterns in lanes 4 and 5 
indicate that CDC9 repression does not alter global chromatin structure. 

c, Okazaki fragments accumulate during S phase. Cells were arrested in G1 using 
a-factor, during which time CDC9 transcription was inhibited by the addition of 
doxycycline, and degradation of the protein stimulated by activation of the 
degron system using galactose and 37 °C. Okazaki fragments appear upon 
release of the culture into S phase (lanes 3 and 4). d, Okazaki fragments are 
bordered by ligatable nicks. Purified DNA was treated (lanes 2 and 4) or mock- 
treated (lanes 1 and 3) with the indicated ligase, and then labelled as in Fig. la. 
The inability to label fragments after ligase treatment confirms that labelled 
Okazaki fragments are flanked by ligatable nicks. 


trinucleosome-sized fragments observed on ligase inhibition are 
generated in S phase and bordered by ligatable nicks—the essential 
properties of ligation-competent Okazaki fragments. 


Global distribution of Okazaki fragments 


To explore further the relationship between Okazaki fragments and 
nucleosomes, we developed a deep sequencing approach to map the 
strand, position and abundance of fragments across the S. cerevisiae 
genome. To allow yeast to complete S phase in the presence of nicked 
DNA we inactivated the DNA damage checkpoint by deleting the 
RAD9 gene. Additionally, to deplete ligase activity further we deleted 
the second DNA ligase (DNL4). DNL4 or RAD9 deletion does not 
affect Okazaki fragment size (Supplementary Fig. 3). Okazaki frag- 
ments were harvested from an asynchronous culture after a ~2.5h 
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ligase inactivation. Small single-stranded fragments were purified by 
anion exchange chromatography in alkaline conditions and sequen- 
cing primers were ligated directly to each end’* (Supplementary Fig. 4). 
Importantly, this method preserves strand identity. 

After paired-end deep sequencing, we aligned the data to the 
S. cerevisiae genome and found that use of a single asynchronous 
culture was sufficient to attain complete coverage (a representative 
chromosome is shown in Fig. 2a, and a comparison of replicates in 
Supplementary Fig. 5). Sequencing reads aligning to the Watson or 
Crick strands showed a complementary distribution with strand bias 
being particularly pronounced close to replication origins'®; such 
asymmetry is expected, given that replication forks proceed bidirec- 
tionally from origins (Fig. 2b). The strong strand bias observed around 
experimentally validated replication origins mapped at high resolution 
(Fig. 2c)? demonstrates preferential sequencing of nascent lagging 
strands, unequivocally confirming that the DNA species enriched after 
ligase repression are Okazaki fragments. Detailed analysis of global 
Okazaki fragment distributions can provide mechanistic insight into 
replication-fork dynamics (S. McGuffee, D.J.S. and LW., manuscript in 
preparation). 


Okazaki fragments terminate within nucleosomes 

The periodic size of Okazaki fragments observed in Fig. 1 led us to 
investigate how the ends of Okazaki fragments relate to nucleosome 
positions found within the yeast genome. We aligned Okazaki frag- 
ment termini to a reference list of all consensus nucleosome midpoints 
(dyads) in S. cerevisiae’. Inherent bias towards smaller fragments 
during purification, library amplification and sequencing result in size 
distributions that differ substantially between starting material and 
sequenced fragments (Supplementary Fig. 6); therefore, we randomly 
selected fragments from our sequencing data to approximate the 
mononucleosome-sized fragments we detected by electrophoresis 
(Supplementary Fig. 6). Aligning both the 5’ and 3’ ends of mature 
(Fig. 3a) mononuclesome-sized fragments against the dyads of 
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Figure 2 | Sequenced Okazaki fragments show a pronounced bias towards 
the lagging strand. a, Distribution of paired-end sequencing hits mapping to 
either the Watson (blue, above the axis) or Crick (orange, below) strands across 
S. cerevisiae chromosome 10. The locations of replication origins’” are indicated 
as grey dashed lines. Data are unsmoothed. b, The anticipated distribution of 
Okazaki fragments surrounding an efficient origin. c, Sequenced Okazaki 
fragments are strongly enriched in regions predicted to be replicated as the 
lagging strand. Log; ratio of Watson strand: Crick strand fragments across a 50- 
kb window around ARS consensus sequences (ACS) confirmed previously” to 
correspond to active origins. Data are smoothed to 200 bp. 
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Figure 3 | Okazaki fragment termini are preferentially located at 
nucleosome dyads. a, Schematic indicating the mature Okazaki fragment 
termini sequenced in this study. b, Okazaki fragment termini are enriched 
around nucleosome dyads. The distribution of termini from mononucleosome- 
sized Okazaki fragments around the top 50% consensus S. cerevisiae 
nucleosome dyad locations”! is shown. Unless otherwise indicated, all analyses 
use unsmoothed data normalized to the maximum signal in the analysed range, 
and area aligned such that Okazaki fragment (OF) synthesis by Pol 5 proceeds 
from left to right. c, The distribution of Okazaki fragment termini correlates 
with nucleosome occupancy” around TSSs. Data are aligned to TSSs such that 
the direction of transcription is from left to right: only Okazaki fragments 
synthesized in the same direction as transcription are analysed here; equivalent 
analyses for fragments synthesized in the opposite direction can be found in 
Supplementary Fig. 7. Data are smoothed to 5 bp. d, Okazaki fragments at 
nucleosome-depleted regions are disproportionately likely to be long. Equal 
numbers of reads were selected from the top (long), and the bottom three 
(short) length quartiles of sequenced fragments. The log, ratio of long to short 
fragments is plotted. Data are smoothed to 5 bp. 


high-confidence nucleosomes indicated that the highest density of 
ends occurs at the dyad (Fig. 3b). Alignment of unfiltered data and 
dinucleosome-sized fragments is comparable to the mononucleosome 
size-selected subset (Supplementary Fig. 6). Therefore, rather than 
being overrepresented in the accessible internucleosomal linker 
DNA, we find that Okazaki fragment termination is most likely to 
occur at locations corresponding to nucleosome dyads. 


DNA-bound proteins dissociate Pol 5 

To confirm that the observed correspondence between nucleosome 
occupancy and Okazaki fragment end density was not biased by a small 
subset of nucleosomes, we analysed transcription start sites (TSSs), at 
which nucleosome positioning follows a well-defined pattern’. 
Alignment of Okazaki fragment termini around ~4,500 TSSs indi- 
cated a marked correlation between nucleosome occupancy and the 
density of Okazaki fragment ends (Fig. 3c and Supplementary Fig. 7) 
across a broad region surrounding TSSs. Precise nucleosome position- 
ing around TSSs is determined by chromatin remodelling””’, allowing 
us to infer that the correlation refers to remodelled, rather than intrins- 
ically preferred, nucleosome locations. 

If the ends of Okazaki fragments are dictated by nucleosomes then 
nicks should always be in register with positioned nucleosomes, 
regardless of internucleosomal spacing. Thus, nucleosomes separated 
by long linker DNA should be associated with longer Okazaki frag- 
ments. We took advantage of the differential nucleosome spacing 
surrounding the nucleosome-depleted region (NDR) at TSSs, com- 
paring the distribution of equal numbers of randomly selected long 
and short fragments (defined, respectively, as the top quartile of 
sequenced fragments by length, and the bottom three quartiles). 
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When the ratio of long to short fragments is plotted across this region 
(Fig. 3d and Supplementary Fig. 7), we find that fragments whose ends 
map within nucleosomes bordering the NDR are disproportionately 
long. Thus, Okazaki fragment length reflects nucleosome spacing. 

As well as correlating with nucleosome occupancy, Okazaki frag- 
ment termini were enriched around known transcription factor bind- 
ing sites (Fig. 3c)**”*. This suggested that, in addition to nucleosomes, 
sequence-specific DNA binding factors might directly influence the 
ends of Okazaki fragments. Such correlations would be expected if, 
following replication-fork passage, some transcription factors rapidly 
bound to replicated DNA and impeded Pol 6 during strand-displacement 
synthesis, causing the polymerase to dissociate and leave a nick close to 
the site of collision. Furthermore, this model would also provide an 
explanation for the observed link between nucleosome occupancy 
and Okazaki fragment termination: nucleosomes contain numerous 
weak protein-DNA interactions, whose cumulative strength peaks 
around the dyad”®. Polymerases can invade nucleosomes, but experi- 
ence increasing resistance as they approach the dyad*””’. Therefore, if 
nucleosomes are rapidly re-established on the lagging strand behind the 
replication fork, and Pol 6 invades them during strand-displacement 
synthesis, the likelihood of polymerase stalling and/or dissociation will 
increase with proximity to the dyad (see model in Supplementary Fig. 1), 
producing Okazaki fragments sized according to the nucleosome 
repeat. Other models that invoke biased Okazaki fragment initiation 
due to replisome pausing on encountering nucleosomes would pro- 
duce distinct correlations around NDRs (Supplementary Fig. 8). 

If polymerization by Pol 4 is inhibited by sequence-specific DNA- 
bound transcription factors, Okazaki fragment ends should accumulate 
on the side of the factor that is first encountered by Pol 6. Aligning all 
transcription factor binding sites”* indicated a significant enrichment of 
precisely juxtaposed Okazaki fragment 5’ and 3’ termini around the 
replication-fork-proximal side of the binding site (Fig. 4a). However, 
when the Abf1, Reb] and Rap1 binding sites were considered separately 
we found that, among transcription factors, these proteins most 
strongly biased the positioning of Okazaki fragment ends (Fig. 4b, c 
and Supplementary Fig. 9). Abf1, Reb1 and Rap1 have essential roles in 
chromatin organization**”’, and may therefore bind more rapidly and 
tightly to newly replicated DNA than the majority of transcription 
factors. We observed a strand-dependent enrichment of Okazaki frag- 
ment termini around the 3’ ends of tRNA genes, as well as transcription 
factor TFIIIB binding sites (Supplementary Fig. 10). Thus, it seems that 
diverse DNA binding proteins can influence the processing of Okazaki 
fragments and probably facilitate the dissociation of Pol 6, with tightly 
bound protein complexes stimulating termination more precisely than 
those, such as nascent histones, that have more diffuse binding regions. 


Chromatin assembly dictates nick location 
Two testable predictions arise from the model that Pol 6 dissociates 
via interaction with nascent nucleosomes: (1) disrupting nucleosome 
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Figure 4 | Transcription factors with roles in nucleosome positioning 
stimulate dissociation of Pol 6. a, Okazaki fragment termini are enriched on 
the replication-fork-proximal side of known transcription factor (TF) binding 
sites”. Mid, midpoint of TF binding site. b, c, The enrichment of Okazaki 
fragment termini around transcription factor binding sites can be attributed 
almost entirely to three factors—Abf1, Reb1 and Rap1—known to have roles in 
nucleosome positioning. 
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assembly will impair nucleosome-mediated Okazaki fragment ter- 
mination and should, therefore, alter the size distribution of the frag- 
ments; and (2) reducing Pol 6 processivity will result in a shift of 
Okazaki fragment termination sites away from the dyad, towards 
the replication-fork-proximal side of nucleosomes. 

The deposition of (H3H4), tetramers on nascent DNA—the first 
stage in nucleosome assembly—is mediated in part by CAF-1, a multi- 
subunit histone chaperone complex associated with the replisome via 
PCNA”. We deleted each subunit of CAF-1 individually, and com- 
pared the length of Okazaki fragments in the deletion strains to those 
with wild-type CAF-1 (Fig. 5a). Loss of any CAF-1 subunit abrogated 
the periodic sizing of Okazaki fragments, significantly increasing their 
average length. These data are consistent with a global delay in 
(H3H4), deposition following replication, with the resulting paucity 
of tetrasomes/nucleosomes on the lagging strand leading to increased 
strand displacement by Pol 6 and thus to longer Okazaki fragments. 

In S. cerevisiae, Pol 6 is a heterotrimer whose Pol32 subunit 
increases processivity and PCNA binding affinity’'; Pol 6 lacking 
Pol32 shows decreased strand-displacement synthesis in vitro and 
probably in vivo’. Labelled Okazaki fragments from a po/32A strain 
were more heterogeneous but were still generally sized according to 
the nucleosome repeat (Fig. 5b). When the positions of the 5’ and 3’ 
termini of mononucleosome-sized fragments (165 + 15 nucleotides, 
as per Fig. 3b) from the pol32A strain were aligned against nucleosome 
midpoints, a clear shift was observed towards the first edge of the 
nucleosome encountered by the polymerase (Fig. 5c and Supplemen- 
tary Fig. 11). Therefore, when Pol 5 processivity is perturbed, the 
polymerase dissociates more rapidly due to an inability to invade nascent 
nucleosomes assembled on Okazaki fragments. The new maximum end 
density occurs ~35-40 nucleotides from the dyad, consistent with the 
extent of DNA protected by (H3H4), tetramers’: nucleosome assembly 
occurs step-wise, with assembly of (H3H4), tetramers preceding 
recruitment of H2A/H2B™; it is therefore possible that the histone- 
DNA species encountered by Pol 6 is a tetrasome rather than a nucleo- 
some, although our data do not allow us to distinguish between these 
possibilities. Importantly, the distribution of Okazaki fragment termini 
around transcription factor binding sites was identical between wild- 
type and po/32A strains (Supplementary Fig. 9), allowing us to con- 
clude that the shift towards the edge of nucleosomes does not simply 
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Figure 5 | Impaired chromatin assembly and Pol 5 processivity affect the 
size of Okazaki fragments and the location of their termini, respectively. 
a, Deletion of any component of the CAF-1 complex (lanes 2-4) results in 
Okazaki fragments that are no longer sized according to nucleosomes. WT, 
wild type. b, Deletion of the pol32 subunit of Pol 6 (lane 2) does not abrogate the 
chromatin-like size distribution of Okazaki fragments. c, In the absence of 
Pol32, the termini of mononucleosome-sized Okazaki fragments are shifted 
towards the replication-fork-proximal edge of the nucleosome: peak density 
occurs at a location consistent with the predicted edges of (H3H4), tetramers. 
See also Supplementary Fig. 11. 
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represent a constant decrease of ~35-40 bp in the extent of strand 
displacement. 


Discussion 

Our data suggest a mechanism by which Pol 4 may reliably remove DNA 
synthesized by Pol « while avoiding excessive strand-displacement syn- 
thesis. Repeated cycles of extension and cleavage can occur during the 
synthesis of a single Okazaki fragment. After removal of the RNA 
primer, all DNA flap structures generated by Pol 5 are biochemically 
indistinguishable from one another, and an external mechanism is 
therefore required to measure the extent of strand displacement already 
carried out by Pol 6 on each fragment. The removal of Pol 6 by newly 
deposited histones represents a simple way to constrain Pol 6 extension, 
and might allow nucleosome assembly to stimulate replication-fork 
progression directly. Although our data imply that nucleosomes 
strongly impede Pol 5, we note that both Fenl and DNA ligase I can 
act efficiently on nucleosomal substrates***°. We speculate that carrying 
out Okazaki fragment processing in the context of nucleosomes rather 
than DNA allows the ligation reaction to outcompete further strand 
displacement by Pol 6. 

CAF-1 is not essential in S. cerevisiae, suggesting that other 
histone chaperones such as Asfl, Rtt106 and HIR” allow sufficient 
replication-coupled or post-replicative nucleosome assembly for viab- 
ility. Nevertheless, our results are consistent with a temporal delay in 
nucleosome assembly in CAF-1 mutants, which may give rise to previ- 
ously reported silencing defects**. The importance of replication- 
coupled chromatin assembly in metazoa is illustrated by the severe 
phenotypes of CAF-1 or ASF1 disruption in human cell lines: ASF1 
depletion leads to replication-fork stalling”, and CAF-1 depletion also 
precludes progression through S phase”. 

Our studies demonstrate that the location of Okazaki fragment 
termini can be determined by interactions between the lagging-strand 
polymerase and nascent nucleosomes. This observation provides the 
first direct mechanistic evidence for the coupling of DNA replication 
to chromatin assembly on the newly replicated daughter genomes. 
Such coupling is fundamentally important given the part played by 
chromatin in the regulation of gene expression, as well as the potential 
for the epigenetic inheritance of precisely located modified nucleo- 
somes. In addition, transcription factors known to have roles in the 
establishment of chromatin structure are apparently able to re-bind to 
DNA immediately after replication-fork passage. Unlike histones, 
which are present in sufficient number to be distributed to both 
daughter genomes, DNA-bound transcription factors are present at 
only one local copy per two daughters. Regulation of transcription 
factor re-binding could thus have a key role in asymmetric epigenetic 
inheritance. 


METHODS SUMMARY 


Genomic DNA was prepared from spheroblasts essentially as described for medium 
resolution DSB mapping“, labelled using exo-Klenow fragment (NEB) and «-**P 
dCTP and separated in 1.3% denaturing agarose gels. 

Okazaki fragments were purified by binding heat-denatured genomic DNA to 
Source 15Q (GE Healthcare) at pH 12, 300 mM NaCl, and eluting in 50 mM steps 
to 1 M. Primers for paired-end Illumina TruSeq sequencing were directly ligated 
to each end. Following second-strand synthesis and size selection by gel purifica- 
tion, libraries were amplified by PCR, purified from two sequential 2.5% native 
agarose gels, and sequenced directly. Full details are in Supplementary Fig. 4 and 
Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


DNA purification. Yeast strains carrying degron-tagged, doxycycline-repressible 
alleles of CDC9 and a galactose-inducible UBR1 allele (see Supplementary Table 1 
for a list of strains) were grown at 30 °C in YEP supplemented with 2% raffinose. 
At optical density (OD) 0.4, galactose and doxycycline were added to final con- 
centrations of 2% and 40 mg1™ u respectively, and the culture shaken at 37 °C for 
2.5h. Fifty-millilitre cultures were used for labelling experiments, and 250-ml 
cultures for purification and library generation. 

Genomic DNA was prepared from spheroblasts as described for medium 

resolution DSB mapping“. Following ligase repression, cells were collected by cent- 
rifugation, washed in SCE buffer (1M sorbitol, 100 mM sodium citrate, 60 mM 
EDTA, pH 7.0) and spheroblasted for 3 min with 5 mg zymolyase 20T (USB) per 
50-ml culture. Spheroblasts were washed with SCE, and resuspended in 480 ul lysis 
buffer (50mM Tris-HCl, pH 8.0, 50mM EDTA, 100mM NaCl, 1.5% sarkosyl) 
containing 150 jg proteinase K (Fisher). Digestion was carried out for 2-16h at 
37 °C. After digestion, residual proteins and peptides were precipitated by adding 
200 ul 5M KOAc and spinning at 16,000g for 30 min at 4°C. Nucleic acids were 
precipitated from the supernatant by addition of 500 ll isopropanol and centrifu- 
gation at 16,000g for 10 min. Pellets were washed twice with 500 pl 70% ethanol, 
resuspended in 200 pil STE buffer (10 mM Tris-HCl, pH 8.0, 1 mM EDTA, 100 mM 
NaCl) and digested with 25 jig RNase A (Sigma) and/or 10 U RIboshredder RNase 
blend (Epicentre) at 37 °C for 30 min. Genomic DNA was precipitated by addition 
of 20 pl NaO Ac, pH 5.5 and 800 il ethanol followed by centrifugation at 5,000g for 
10 min at room temperature (25 °C). Pellets were washed with 70% ethanol and 
resuspended in 1 pl TE (10 mM Tris:Cl pH 7.5, 0.1mM EDTA) per ml original 
culture volume. DNA was stored at 4 °C and never frozen. 
DNA labelling. Two microlitres of DNA (corresponding to the genomic DNA 
content of 2 ml cultured cells) was used in 20 1 labelling reactions containing 5 U 
Klenow (exo-)polymerase (NEB) and o#-dCTP (Perkin Elmer) at a final concen- 
tration of 33 nM. Free label was removed using Illustra microspin G-50 columns 
(GE healthcare). Labelled DNA was separated in 1.3% denaturing agarose gels 
(50 mM NaOH, 1 mM EDTA). After electrophoresis, the gel was neutralized and 
DNA transferred to an uncharged nitrocellulose membrane (Hybond-N; GE 
healthcare) via capillary transfer. Membranes were exposed to phosphor screens 
or film. 


Okazaki fragment purification. Genomic DNA purified as described earlier was 
denatured by heating to 95 °C for 5 min, rapidly cooled on ice and brought to 
300 mM NaCl, pH 12. Purification was carried out in batch using 400 pl Source 
15Q (GE healthcare), binding at 300 mM NaCl, pH 12 and eluting in 50 mM steps 
to 1M NaCl, pH 12. As determined by purification of fragments pre-labelled as 
above (see also Supplementary Fig. 4), fractions from 800-900 mM NaCl con- 
tained the majority of fragments of interest. DNA was ethanol precipitated and 
treated with 10 U Riboshredder RNase blend for 30 min at 37 °C to remove residual, 
undigested RNA: digestion products were removed using Illustra microspin G-50 
columns to leave essentially pure Okazaki fragments. 

Sequencing library generation. Adaptor primer pairs with single-stranded 
overhangs (shown schematically in Supplementary Fig. 4) were annealed by cooling 
from 95 °C and purified from 12% native polyacrylamide gels via standard methods. 
Sequences of the adaptor pairs are as follows. 5’ top, ACACTCTTTCCCTACACG 
ACGCTCTTCCGATCT; 5’ bottom, NNNNNNAGATCGGAAGAGCGTCGTGT 
AGGGAAAGAGTGT; 3’ top, /Phos/AGATCGGAAGAGCGGTTCAGCAGGAA 
TGCCGAG; 3’ bottom, CTCGGCATTCCTGCTGAACCGCTCTTCCGATCTIN 
NNNNN. 

Up to 200 ng denatured purified Okazaki fragments were incubated at 16 °C 
overnight in a ligation reaction containing 1 \1g of each primer pair and 1,000 U 
T4 DNA ligase (NEB). Unligated adaptors were removed using Illustra microspin 
S-300 columns (GE healthcare) and a second strand-synthesis reaction carried 
out at 72°C using Taq polymerase (NEB). Products from ~200-1,000 bp were 
purified from 2.5% agarose gels run in TBE using Qiaquick kits (Qiagen). Purified 
libraries were amplified (16 cycles) using Illumina Truseq primers according to 
Illumina protocols, except that KOD hot start polymerase (Novagen) was used. 
Amplified libraries were purified from two sequential 2.5% agarose gels. 
Subsequent steps in the sequencing workflow were carried out according to 
standard procedures. 

Nucleosome dyad and occupancy data. The top 50% of nucleosome dyads (by 
confidence score), taken from a meta-analysis”, were used in our analysis; nucleosome 
occupancy data (Fig. 3b) were from the YPD data set detailed previously’. 


42. Kaplan, N. et al. The DNA-encoded nucleosome organization of a eukaryotic 
genome. Nature 458, 362-366 (2009). 
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Deciphering a neuronal circuit that mediates appetite 


Qi Wut, Michael S. Clark? & Richard D. Palmiter’ 


Hypothalamic neurons that co-express agouti-related protein 
(AgRP), neuropeptide Y and y-aminobutyric acid (GABA) are 
known to promote feeding and weight gain by integration of various 
nutritional, hormonal, and neuronal signals'”. Ablation of these 
neurons in mice leads to cessation of feeding that is accompanied 
by activation of Fos in most regions where they project**. Previous 
experiments have indicated that the ensuing starvation is due to 
aberrant activation of the parabrachial nucleus (PBN) and it could 
be prevented by facilitating GABA, receptor signalling in the PBN 
within a critical adaptation period’. We speculated that loss of 
GABA signalling from AgRP-expressing neurons (AgRP neurons) 
within the PBN results in unopposed excitation of the PBN, which in 
turn inhibits feeding. However, the source of the excitatory inputs to 
the PBN was unknown. Here we show that glutamatergic neurons in 
the nucleus tractus solitarius (NTS) and caudal serotonergic neurons 
control the excitability of PBN neurons and inhibit feeding. 
Blockade of serotonin (5-HT3) receptor signalling in the NTS by 
either the chronic administration of ondansetron or the genetic 
inactivation of Tph2 in caudal serotonergic neurons that project 
to the NTS protects against starvation when AgRP neurons are 
ablated. Likewise, genetic inactivation of glutamatergic signalling 
by the NTS onto N-methyl D-aspartate-type glutamate receptors 
in the PBN prevents starvation. We also show that suppressing 
glutamatergic output of the PBN reinstates normal appetite after 
AgRP neuron ablation, whereas it promotes weight gain without 
AgRP neuron ablation. Thus we identify the PBN as a hub that 
integrates signals from several brain regions to bidirectionally 
modulate feeding and body weight. 

Administration of diphtheria toxin (DT) to Agrp?’™® mice, which 
express the human DT receptor selectively in AgRP neurons, ablates 
nearly all AgRP neurons in the arcuate nucleus of the hypothalamus; 
during the next 6 days the mice gradually cease eating, lose body 
weight, and die without intervention*. Chronic infusion of bretazenil, 
a partial agonist of GABA, receptor, into the PBN during the DT 
treatment prevents starvation and allows an adaptive process to take 
place such that the mice eat and maintain their body weight®. Not only 
does ablation of AgRP neurons inhibit the initiation of meals, it also 
decreases the amount of liquid food that will be swallowed when it is 
delivered directly into the mouth’. Because the PBN responds to visceral 
malaise, such as food poisoning and LiCl treatment’, and also processes 
gustatory signals in paradigms such as the conditional taste aversion or 
preference”"’, we predicted that ablation of AgRP neurons results in 
unopposed activation of the PBN, which may mimic a nausea signal and 
thereby inhibit feeding. To test this hypothesis, we infused ondansetron, 
an anti-nausea drug that antagonizes 5-HT; receptors’, subcutaneously 
or directly into the fourth ventricle, starting 3 days before injection of 
Agrp?™® mice with DT. Despite the fact that the drug is administered 
orally to people, only central delivery of ondansetron prevented fatal 
weight loss and allowed the mice to recover (Fig. 1a and Supplementary 
Fig. 1a). Consumption of low-fat chow pellets by ondansetron-treated 
mice fell and they lost roughly 10% of their body weight during the first 
week after DT treatment; however, they then gradually ate more and 


regained body weight by 3 weeks after DT treatment (Fig. la and 
Supplementary Fig. la). The 5-HT3 receptor is an excitatory ion 
channel that is expressed widely in the brain, especially in the cortex 
and dorsal brainstem’’. To examine more precisely where ondansetron 
acts to prevent starvation after AgRP neuron ablation, the drug was 
delivered bilaterally to either the PBN or the NTS (see Supplementary 
Fig. 2 for cannula placement). Delivery of ondansetron to the PBN did 
not rescue the starvation phenotype of DT-treated mice, whereas 
delivery to the NTS prevented starvation (Fig. 1b and Supplemen- 
tary Fig. 1b). The results suggest that serotonin provides some of the 
excitatory drive that indirectly results in hyperactivity of the PBN after 
loss of inhibitory input from AgRP neurons. Neurons in the NTS are 
known to send excitatory, glutamatergic inputs to the PBN’*"*. We 
therefore predicted that serotonin action on 5-HT; receptors in the 
NTS promotes hyperexcitation of the PBN, which can be measured as 
local Fos gene activation’. Consistent with this hypothesis, Fos induc- 
tion in the PBN was significantly ameliorated by the administration of 
ondansetron in the NTS (Supplementary Fig. 3). We conclude that 
inhibition of 5-HT3-mediated excitatory currents in the vicinity of 
the NTS prevents starvation after the ablation of AgRP neurons and 
promotes an adaptation that allows feeding to be maintained in the 
absence of AgRP neurons. 

Tryptophan hydroxylase 2 (Tph2) catalyses the first and rate- 
limiting step in serotonin biosynthesis in the central nervous system”. 
To examine the role of serotonin more directly, conditional Tph2’°”'°* 
mice carrying the Agrp?'’* allele were generated and then injected 
with CAV2-Cre, a virus that is retrogradely transported from the site of 
injection to the cell bodies where it can inactivate the Tph2 gene only in 
those neurons that project to the injection site. CA V2-Cre was injected 
bilaterally into the NTS of Tph2’"*; Agrp?™* mice, and 8 days later 
they were treated with DT to ablate the AgRP neurons. This viral 
treatment prevented the starvation that normally occurs after ablation 
of AgRP neurons. Feeding and body weight decreased slightly after DT 
treatment of the virally transduced mice, but some of the mice restored 
normal food intake and regained body weight (Fig. 1c and Supplemen- 
tary Fig. 1c). Various raphe nuclei from the virally rescued Tph2’°'°* 
mice were examined for serotonergic neurons that lacked Tph2 but 
retained L-aromatic amino acid decarboxylase (AADC), another 
marker of serotonergic cells. Many serotonergic cell bodies in the raphe 
obscurus (ROb) and raphe magnus (RMg) were found that signifi- 
cantly lacked Tph2 staining (Fig. 2a-l). Quantification of the results 
revealed that viral treatment decreased Tph2 signalling in ROb and 
RMg serotonergic neurons by 60-80% (Fig. 2m), whereas serotonergic 
neurons in the dorsal raphe (DR) were unaffected (Fig. 2m, and data 
not shown). Viral injection into the NTS of Tph2'°”"** mice decreased 
serotonin levels in the NTS by about 75% (wild-type, 11.65 + 0.92 ng 
per mg of protein; virus-injected, 2.68 + 0.57 ng per mg of protein). 
Our observations are consistent with known projections of the caudal 
raphe neurons to brainstem structures and projections of DR neurons 
to forebrain regions'®. 

Because many of the neurons in the NTS are known to send gluta- 
matergic projections to the PBN’*", we predicted that serotonergic 
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Figure 1 | Chronic administration of ondansetron into the NTS, or genetic 
inactivation of serotonergic input to the NTS prevents starvation in AgRP 
neuron-ablated mice. a, Body weight of Agrp?"* mice and wild-type (WT) 
mice after either subcutaneous (s.c.; 1 mg kg! d-', n =6) or fourth-ventricle 
(4v; 0.1 mgkg | d_', n= 12) infusion of ondansetron, a 5-HT; receptor 
antagonist, through osmotic minipumps. DT was injected intramuscularly 
twice as indicated by arrows. Minipumps were removed on day 11 after the 
pump content was depleted. Eight of 12 mice survived the treatment of DT and 
infusion of ondansetron into 4v (blue line). Asterisk, P< 0.01 between 
Agrp?™* mice treated with ondansetron s.c. (n = 6 non-survivors) and 4v 
(n = 8 survivors). b, Body weight of DT-treated Agrp?™* mice after chronic 
infusion of ondansetron or vehicle into either the NTS (m = 14) or the PBN 
(n = 8). DT was twice injected intramuscularly as indicated by arrows. Six of 14 
mice survived the treatment of DT and infusion of ondansetron into the NTS 
(blue line). Asterisk, P< 0.01 between Agrp?™* mice infused by ondansetron 
into the NTS (n = 6 survivors) and the PBN (n = 8 non-survivors). c, Body 
weight of DT-treated, Agrp?”™*;Tph2°”* mice after bilateral injection of 
CAV2-Cre virus or vehicle into the rostral part of the NTS (rNTS). Seven of 12 
mice survived the treatment with DT and viral infection (blue line). Asterisk, 
P<0.05 between mice injected with CAV2-Cre (n = 7 survivors) and vehicle 
into the NTS (m = 8 non-survivors). Daggers indicate mice that were removed 
from the experiment when they either lost 20% of their body weight or seemed 
moribund. Results are shown as means + s.e.m. 


activation of the NTS might lead to glutamatergic activation of the PBN, 
which could be responsible for starvation after AgRP neuron ablation. To 
test this hypothesis, we inactivated the vesicular glutamate transporter 2 
(Vglut2, encoded by the Slc17a6 gene) within the NTS by injecting AA V1- 
CreGFP virus bilaterally into the NTS of Slc17a6°"""*; Agrp?™* mice 
8 days before the initiation of DT treatment (see Supplementary Fig. 4 
for the placement of virus injections). Mice that were injected bilaterally 
and treated with DT maintained body weight and feeding, whereas 
mice that were unilaterally injected starved, as did mice with vehicle 
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Figure 2 | Serotonergic projections from the ROb and RMg to the NTS 
mediate starvation after ablation of AgRP neurons. a-f, Representative 
immunohistochemistry pictures of Tph2 and AADC, markers of serotonergic 
neurons, in Agrp??®’*; Tph2!°*"* mice after bilateral injection of either vehicle 
(a-c) or a retrograding CA V2-Cre virus (d-f) into the rostral NTS. Arrowheads 
indicate the serotonin neurons within the ROb (B2 group) in which the 
expression of Tph2 was gone after viral infection. g-I, Representative 
immunostaining pictures of Tph2 and AADC at the RMg (B3 group) from the 
mice described above. Arrowheads indicate the serotonin neurons within the 
RMg in which the expression of Tph2 was gone after viral infection. 

m, Quantified immunohistochemistry results of AADC-expressing neurons 
that co-localized with Tph2-expressing neurons in the ROb, RMg and DR of 
the mice described in a-f (and data not shown). Asterisk, P< 0.01, analysis of 
variance (ANOVA); 1 = 6 mice per group. Results are shown as means and 
s.e.m. Scale bar in a (for a-l), 400 um. 


injection (Fig. 3a and Supplementary Fig. 5). To further establish that 
glutamatergic activation of the PBN inhibits feeding in this model, we 
used a viral/genetic approach to reduce N-methyl-D-aspartate (NMDA) 
receptors in the PBN and thereby dampen the excitability by glutamate. 
Gril”; Agrp?™®’* mice, which carry two conditional alleles of the 
gene encoding the essential NR1 subunit of NMDA receptors and the 
Agrp?’® allele, were injected bilaterally with AAV1-CreGFP in the PBN 
9 days before ablation of the AgRP neurons by DT (see Supplementary 
Fig. 6 for the placement of virus injections). For the bilaterally injected 
mice, body weight declined slightly during the first 8 days after DT injec- 
tion, but then recovered along with a rebound of food intake, whereas the 
vehicle-injected and unilaterally injected mice stopped eating and did not 
recover (Fig. 3b and Supplementary Fig. 7). These experiments show 
that either decreasing glutamatergic signalling by neurons within the 
NTS or decreasing the number of NMDA receptors in the PBN protects 
against the starvation caused by the ablation of AgRP neurons. Most 
neurons within the PBN are glutamatergic. We therefore predicted that 
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Figure 3 | Viral-mediated disruption of glutamatergic circuitry between the 
NTS and PBN, or glutamatergic output of the PBN, rescues feeding after 
ablation of AgRP neurons. a, Percentage of initial body weight of DT-treated 
Agrp?T®*; Slc17a6'"** mice after either bilateral (n = 19) or unilateral (n = 9) 
injection of AAV1-CreGFP virus or vehicle into the rostral NTS (rNTS; 1 = 9). 
AAV1-CreGFP virus decreases glutamatergic signalling from the rNTS to 
downstream targets, including the PBN. Out of 19 mice, 14 survived DT 
treatment and viral injections into the NTS (orange line). Asterisk, P< 0.01 
between bilateral virus injection (n = 14 survivors) and unilateral virus 
injection (n = 9 non-survivors) or vehicle injection (m = 9 non-survivors). 

b, Percentage of initial body weight of DT-treated Agrp?™™*; Grin1'°”"* mice 
after either bilateral injection (m = 18) or unilateral injection of AA V1-CreGFP 
virus (n = 8) or vehicle (m = 8) into the lateral PBN. AAV1-CreGFP virus 
attenuates NMDA receptor signalling in the PBN, which receives dense 
glutamatergic projections from the NTS. Out of 18 mice, 11 survived DT 
treatment and viral injections into the PBN (orange line). Asterisk, P< 0.01 


suppression of glutamatergic signalling by the PBN should also protect 
against starvation when AgRP neurons are ablated. We used the 
Sle17a6°"*; Agrp?™™* mice again for this experiment and injected 
AAVI1-CreGFP into the lateral PBN 9days before DT treatment 
(Supplementary Fig. 5). The virally injected mice lost body weight 
during the first week after DT treatment but then gradually recovered, 
whereas the controls did not recover (Fig. 3c and Supplementary Fig. 8). 
In another cohort of mice, AA V1-CreGFP was injected into the PBN of 
Sle17a6°"*; Agrp?™* mice, but they were not treated with DT. Those 
mice gained about 20% in body weight which was accompanied by 
about 16% increase in food intake over the next 3 weeks (Fig. 3d and 
Supplementary Fig. 9). Our results indicate that enhanced glutamatergic 
signalling by the PBN inhibits feeding and promotes weight loss, 
whereas lowering the glutamatergic output of the PBN promotes weight 
gain through an increase in feeding, possibly combined with a decrease 
in energy expenditure. 

Our studies reveal six manipulations that allow mice to survive after 
the ablation of AgRP neurons: enhancement of GABA, receptor sig- 
nalling in the PBN with bretazenil’; suppression of 5-HT3 receptor 
signalling in the NTS with ondansetron; disablement of serotonergic 
input to the NTS through viral-mediated removal of Tph2; reduction 
of glutamatergic signalling by the NTS by the removal of Vglut2; 
reduction of glutamatergic activation of the PBN by decreasing the 
number of NMDA receptors; or reduction of efferent glutamatergic 


between bilateral virus injection (n = 11 survivors) and unilateral virus 
injection (n = 8 non-survivors) or vehicle injection (m = 8 non-survivors). 

c, Percentage of initial body weight of DT-treated Agrp?™*; Sic17a6°""* mice 
(n = 21) or DT-treated Agrp? *;$lc17a6"”* control mice (n = 10) after 
bilateral injection of AAV1-CreGFP virus into the lateral PBN. AAV1-CreGFP 
virus abolishes glutamatergic signalling from the PBN to various forebrain 
targets. Out of 21 mice, 12 survived DT treatment and viral injections into the 
PBN (blue lines). Asterisk, P< 0.01, ANOVA between 12 survivors of Vglut2- 
deficient group and 10 non-survivor control group. d, Percentage of initial body 
weight of Agrp?”®*; SIc17a6"°*""* mice after bilateral injection of AAV1- 
CreGFP virus (n = 14) or vehicle (n = 10) into the PBN. Asterisk, P< 0.01; 
ANOVA between mice with precise viral injection (n = 7) and vehicle injection 
(n = 10). Results are shown as means + s.e.m. Daggers indicate mice that were 
removed from experiment when they either lost more than 20% of body weight 
or seemed moribund. 


PBN 


NTS 


Anorexigenic 
response 


RMg/ROb 


Vagal sensory 
inputs 


Nutritional signals 


Figure 4 | Diagram illustrating circuitry that mediates loss of appetite after 
acute ablation of hypothalamic AgRP neurons. AgRP neurons co-expressing 
AgRP, neuropeptide Y (NPY) and GABA send inhibitory projections to the 
PBN. Serotonergic neurons residing in the RMg and ROb inhibit feeding 
through excitation of postsynaptic neurons in rostral NTS that express 5-HT3 
receptors (5-HT3R). This subpopulation of NTS neurons, by integration of 
visceral and gustatory inputs, sends excitatory glutamate signalling to the lateral 
PBN neurons that express NMDA receptors (NMDAR). Nutritional signalling 
from the hypothalamus and sensory signals may interact within the PBN to 
promote appropriate feeding responses. ARC, arcuate nucleus; GABA,R, 
GABAag receptor. 
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signals from the PBN. These observations support the circuit depicted 
in Fig. 4. We suggest that a subpopulation of neurons in the PBN 
integrates visceral and gustatory information from the NTS with 
energy-balance signals emanating from AgRP neurons. The NTS 
responds to vagal inputs as well as gut-derived hormones, while 
AgRP neurons detect nutrient levels and respond to hormones such 
as insulin, leptin and ghrelin'’'’. Consequently, the appetitive res- 
ponse can be modulated by food palatability and visceral condition 
in a manner dictated by current energy balance. 

Some studies have suggested that serotonin exerts its anorectic 
effects by differential actions on 5-HT), and 5-HT>, receptors in the 
hypothalamus to stimulate melanocortin signalling”, whereas a 
recent study indicated that serotonergic neurons in ventral raphe 
nuclei respond to food restriction by an elevated Fos signal”*. We show 
here that serotonin from the ROb and RMg acts on 5-HT3 receptors in 
the NTS to inhibit feeding after the ablation of AgRP neurons; thus, 
some of the anorectic effects of serotonin reuptake inhibitors, such as 
fenfluramine, may also be mediated in the brainstem. Classical mapping 
studies reveal projections from the PBN to the amygdala, thalamus, 
hypothalamus and other brain regions”. Further characterization 
and manipulation of PBN circuits that control feeding will be greatly 
facilitated by the identification of genes that are expressed exclusively 
by the relevant subpopulations of PBN neurons. Our experiments 
help define an important neural pathway within which some unique 
therapeutic targets have been characterized that could be valuable for 
the development of new treatments of various eating disorders, includ- 


ing nausea and anorexia nervosa”’”®. 


METHODS SUMMARY 

Agrp?™* mice, Grint’ mice and Slc17a6°”""* mice on a C57B1/6 background 
were generated and identified genetically by PCR of tail DNA as described previ- 
ously*”’”’, A conditional Tph2 targeting construct was prepared by flanking the first 
exon with loxP sites; the generation of Tph2*" 'ox mice on a mixed C57BI/6 x 129/Sv 
background was based on standard protocols,. All experiments were performed with 
male mice about 8 weeks old. To ablate AgRP neurons, mice carrying the Agrp?™™ 
allele were injected twice intramuscularly with DT (50p1gkg '; List Biological 
Laboratories, Campbell); the injections were 2 days apart*. The extent of ablation 
(more than 95%) was determined by immunohistochemistry*’. Production of 
adeno-associated virus 1 (AAVI-CreGFP) and canine adenovirus 2 (CAV2-Cre) 
followed the protocols described previously”. For injection of virus into the NTS 
or PBN, mice were anaesthetized and virus was injected bilaterally or unilaterally 
through a 5-11 Hamilton syringe. For drug treatments, Alzet 14-day minipumps 
(model 1002) loaded with 100 pil of ondansetron (6 mg ml — 1 Sigma-Aldrich) were 
implanted subcutaneously on the back of anaesthetized mice 4 days before DT 
treatment. Alternatively, cannulas (28 gauge; Plastics One) were placed into fourth 
ventricles and connected by tubing with the subcutaneous minipumps that were 
loaded with ondansetron (0.6 mg ml’). For feeding assays, mice were transferred 
to BioDAQ Food and Water Intake Monitor (Research Diets) supplied with water 
and low-fat chow diet (D12450B). The mice were allowed to acclimatize for 3 days 
before the initiation of each experiment and data collection. Body weight and total 
food intake were recorded every 24h. Feeding and drinking activity was recorded 
in accordance with the manufacturer’s suggested protocol. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Animal maintenance and neuron ablation. Mice were housed in a temperature- 
and humidity-controlled facility with a 12-h light cycle. All animal care and 
experimental procedures were approved by the Institutional Animal Care and 
Use Committee at the University of Washington. In compliance with our 
approved protocol, all experiments were terminated if the body weight of mice 
fell to 80% of their original body weight or they seemed moribund. Agrp?™* 
mice, Grint'°™"°* mice and Slc17a6'"* mice were generated and genetically 
identified by PCR of tail DNA as described previously*’’”*. A conditional Tph2 
targeting construct was prepared by flanking the first exon with loxP sites along 
with a frt-flanked SV-Neo gene in the first intron. The construct was electroporated 
in G4 ES cells, and correctly targeted clones were identified by Southern blotting. 
After removal of the Sv-Neo gene by breeding with a mouse expressing FLP 
recombinase, heterozygotes were bred to generate Tph2"°*"** mice that were used 
for viral injection. Details are available from the authors on request. 

All except Tph2"°*"* mice were on the C57BI/6 background (more than nine 
generations backcrossed); Tph2’*" lox mice were on a mixed 129/Sv X C57BI/6 
background. Agrp?™/P7® male mice were bred with Slc17a6""* female mice 
to generate Agrp?™™*; SIc17a6"”* mice, which were further bred to each other 
to create Agrp?!™*; Sic17a6"* mice and Agrp?™™*; SIc17a6°’* control mice. 
A similar breeding strategy was adopted when generating Agrp?™*; Tph2!?™"* 
mice and Agrp?"™*; Grin1'°"* mice. Mice were group housed with a standard 
chow diet (LabDiet 5053) and water provided ad libitum until the beginning of the 
experiments. All experiments were performed with male mice about 8 weeks old. 
To ablate AgRP neurons, mice carrying the Agrp?’™ allele were twice injected 
intramuscularly with DT (50 pg kg *; List Biological Laboratories, Campbell); the 
injections were 2 days apart’. The extent of ablation (more than 95%) was deter- 
mined by immunohistochemistry*”. 

Viral injections. Production of adeno-associated virus 1 (AAV1-CreGFP) and 
canine adenovirus 2 (CAV2-Cre) followed the protocols described previously**”°. 
For injection of CAV2-Cre or AAV1-CreGFP virus into the rostral NTS, mice were 
anaesthetized and virus (or PBS as the vehicle) was injected bilaterally or unilaterally 
(1 plof roughly 10"° particles per pil each side) through a Hamilton syringe (size 5 ll; 
Hamilton, Reno), using stereotactic coordinates +0.8 mm (x axis), —7.1mm (y 
axis) and —4.3 mm (zaxis). Similarly, AA V1-CreGFP virus (or PBS as the vehicle) 
was injected into the PBN bilaterally or unilaterally, using stereotactic coordinates 
*+1.0 mm (x axis), —5.3 mm (y axis) and —3.3 mm (z axis). Brain samples from all 
mice were collected at the end of the behavioural experiment and processed for 
immunohistological analysis. For all viral injection experiments, only a fraction 
(indicated in figure legends) of the DT-treated mice survived AgRP-neuron 
ablation; subsequent evaluation of CreGFP expression revealed that failure to 
rescue was associated with poor placement or inadequate viral transduction. 

Drug treatments. Alzet 14-day minipumps (model 1002; Durect, Cupertino) 
loaded with 100.1 of ondansetron (6 mg ml”! in saline; Sigma-Aldrich, St 
Louis) were implanted subcutaneously on the back of anaesthetized mice 4 days 
before DT treatment. These minipumps dispense 0.25 ulh '. Alternatively, 
cannulas (28 gauge; Plastics One, Roanoke) were placed into fourth ventricles 
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under anaesthesia and the subcutaneous minipumps, which were loaded with 
ondansetron (0.6 mg ml’ in saline), were connected to the cannulas by tubing 
(PE60; Stoelting, Wood Dale) that was threaded under the skin to help prevent the 
mice from dislodging it. For some experiments, the minipumps (ondansetron, 
0.6 mg ml’) were connected to bilateral cannulas (28 gauge; Plastics One) directed 
to specific brain regions by using the following stereotactic coordinates: PBN, 
+1mm (xaxis), —5.3mm (yaxis) and —3.3 mm (zaxis); NTS, £0.8 mm (x axis), 
—7.1mm (y axis) and —4.3 mm (z axis). The patency and placement of the bilateral 
minipump were verified at the end of each experiment, and brain samples were 
processed for immunohistological analysis. 

Food intake and body weight measurements. For feeding assays, mice were 
transferred to BioDAQ Food and Water Intake Monitor (Research Diets, New 
Brunswick) supplied with water and low-fat (3.85 kcal ml ') chow diet (D12450B; 
Research Diets). The mice were allowed to acclimatize for 3 days before initiation 
of each experiment and data collection. Body weight and total food intake were 
recorded every 24h. Feeding and drinking activity was recorded in accordance 
with the manufacturer’s suggested protocol. 

Immunohistochemistry. Mice were killed by CO asphyxiation and perfused 
transcardially with ice-cold PBS buffer containing 4% paraformaldehyde. Brains 
were dissected and postfixed overnight at 4 °C in the fixation buffer. Free-floating 
brain sections (25 tm) were washed three times in PBS containing 0.1% Triton 
X-100 (PBST buffer) solution (15min each wash) and then blocked with 3% 
normal donkey serum in PBST for 2-3 h at about 23 °C. Rabbit anti-AgRP (dilu- 
tion 1:1,500; Phoenix Pharmaceuticals, Belmont), rabbit anti-Fos (dilution 1:1,000; 
Millipore, Temecula), monoclonal anti-tryptophan hydroxylase (dilution 1:1,500; 
Sigma-Aldrich) and rabbit anti-dopa decarboxylase (equivalent to AADC; dilution 
1:500; Millipore) were applied to the sections for incubation overnight at 4°C, 
followed by three 15-min rinses in PBST. Finally, sections were incubated in Cy2- 
or Cy3-labelled secondary antibody (dilution 1:300; Jackson Immunolaboratory, 
West Grove) before visualization. Images were captured with a digital camera 
mounted ona Leica TCS SP1 confocal microscope (Leica Microsystems); all paired 
photos were obtained through the same system settings. For each group of mice, at 
least eight sections from four different mice were analysed. 

Data analyses. Quantification of Tph2-positive and AADC-positive cells was 
performed with the NIH Image software (National Institutes of Health). 
Anatomical correlations of brain sections and delineation of individual nuclei 
were determined by comparing landmarks of Nissl staining images with those 
given in the stereotactic atlas. From the anatomically matched sections, a region of 
interest of the same size was further defined. Meanwhile, an optimized threshold 
that can discern round nuclei from partly stained ones as well as background 
noise was preset for all measurements. For all experiments only those mice 
with correct placement of cannula or viral injections were compared with the 
control group. Unless otherwise stated, data sets collected from all experiments 
were analysed by one-way ANOVA followed by the Student-Newman-Keuls 
method for statistical significance; results were plotted as means + s.e.m. Post- 
hoc analysis was performed when group differences were significant by ANOVA 
at P<0.05. 
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Visualizing molecular juggling within a 
Bj2-dependent methyltransferase complex 
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Derivatives of vitamin B,, are used in methyl group transfer in 
biological processes as diverse as methionine synthesis in humans 
and CO, fixation in acetogenic bacteria’ *. This seemingly straight- 
forward reaction requires large, multimodular enzyme complexes 
that adopt multiple conformations to alternately activate, protect 
and perform catalysis on the reactive B,, cofactor. Crystal struc- 
tures determined thus far have provided structural information for 
only fragments of these complexes* ”, inspiring speculation about 
the overall protein assembly and conformational movements 
inherent to activity. Here we present X-ray crystal structures of a 
complete 220 kDa complex that contains all enzymes responsible 
for B,.-dependent methyl transfer, namely the corrinoid iron- 
sulphur protein and its methyltransferase from the model acetogen 
Moorella thermoacetica. These structures provide the first three- 
dimensional depiction of all protein modules required for the 
activation, protection and catalytic steps of B,.-dependent methyl 
transfer. In addition, the structures capture B,, at multiple locations 
between its ‘resting’ and catalytic positions, allowing visualization of 
the dramatic protein rearrangements that enable methyl transfer 
and identification of the trajectory for B,, movement within the 
large enzyme scaffold. The structures are also presented alongside 
in crystallo spectroscopic data, which confirm enzymatic activity 
within crystals and demonstrate the largest known conformational 
movements of proteins in a crystalline state. Taken together, this 
work provides a model for the molecular juggling that accompanies 
turnover and helps explain why such an elaborate protein framework 
is required for such a simple, yet biologically essential reaction. 

B,2-dependent methyl transfer lies at the heart of methylation 
biochemistry and is an essential reaction in human health and micro- 
bial CO, sequestration”’. In humans, methionine synthase (MetH) 
methylates homocysteine to form methionine to maintain cellular 
pools of folate (vitamin By) and S-adenosylmethionine (AdoMet), 
the universal methyl donor. MetH mutation or vitamin By, deficiency 
can cause serious health consequences, including megaloblastic anaemia 
and birth abnormalities such as neural tube defects'*. Acetogenic bacteria, 
including M. thermoacetica, use the corrinoid iron-sulphur protein 
(CFeSP) and its methyltransferase (MeTr) together to catalyse methyl 
transfer in the Wood-Ljungdahl carbon fixation pathway for growth on 
CO, as the sole carbon source™. 

For both MetH and CFeSP/MeTr, methyltetrahydrofolate (CH3- 
Hyfolate) is the methyl donor, and a protein-bound Bj, derivative 
(cobalamin for MetH and 5’-methoxybenzimidazolyl cobamide for 
CFeSP) is the methyl carrier. In acetogenic bacteria, the CH3- 
H,folate methyl group is derived from enzymatic reduction of CO, 
whereas in humans, CH3-H,folate is the predominant circulating form 
of the vitamin. Although CH3-H,folate is the common methyl source, 
methyl removal from the N° tertiary amine is chemically challenging 
because the product, tetrahydrofolate (Hyfolate), is a poor leaving 


group’. Therefore, a particularly powerful nucleophile is required, and 
Byz with cobalt in the +1 oxidation state, a Co(I) species dubbed a 
‘supernucleophile’’, is recruited. Such strong reactivity comes at a price: 
reducing the inactive Co(II) state to active Co(I) is thermodynamically 
challenging, as the Co(II/I) reduction potential is one of the lowest in 
nature, —504mV in CFeSP and —526 mV in MetH'””. In CEFeSP, an 
electron is first delivered from a partner protein to an Fe,S, cluster 
harboured by an activation domain’*””. The electron is then passed to 
Co(II) to yield Co(I) (equation (1)), which attacks CH3-H,folate to form 
CH3-Co(III) (equation (2)). CFeSP then delivers the methyl group to the 
Ni,Fe,S4 active site metallocluster (A-cluster) of acetyl-CoA synthase 
(ACS), where it becomes the methyl of acetyl-CoA, and B,, returns to its 
nucleophilic Co(I) state. 


[4Fe-4S]'* + Co(II) © [4Fe-4S]?* + Co(1) (1) 
Co(I) + CH3-Hyfolate <+ CH3-Co(II) + Hyfolate (2) 


During the catalytic cycle of both MetH and CFeSP/MeTr 
(Supplementary Fig. 1), a series of ‘molecular juggling’ acts must be 
performed in which domains rearrange to contact the B,, cofactor. 
Crystal structures of a MetH B,2-binding fragment* and CFeSP from 
Carboxydothermus hydrogenoformans (ChCFeSP)’ both depict a ‘rest- 
ing’ state, where B,, is buried by a protective ‘capping’ domain, 
shielded from unwanted chemistry but inaccessible to substrate. 
Because methyl transfer uses Sy2 substitution”, large conformational 
changes must ‘uncap’ B,, before chemistry can occur. By is ‘uncapped’ 
in structures of MetH fragments that depict B, activation”’”"’, but no 
structure has been solved that shows B,2- and CH3-Hy4folate-binding 
domains together to illustrate methyl transfer. 

To visualize this elusive methyl transfer complex, we determined 
a 2.38A resolution structure of folate-free CFeSP/MeTr from 
M. thermoacetica (Fig. 1 and Supplementary Table 1). The homodimeric 
MeTr component (58 kDa) is virtually identical to previous structures 
of both MeTr*’? (Supplementary Fig. 2a), root mean squared 
deviation (r.m.s.d.) for Cox atoms 0.39 A, and the analogous MetH 
domain that binds CH3-Hyfolate’, r.m.s.d. 1.03-1.08 A. MeTr and 
MetH both use (f/a)g triosephosphate isomerase (TIM) barrels to 
bind and activate CH3-H,folate for nucleophilic attack. Two CFeSPs 
are present in the complex, each containing two subunits. The small 
subunit (35 kDa) is a TIM barrel which acts as the B,. ‘cap’ in the 
ChCFeSP structure’, while the large subunit (48kDa) has three 
domains joined by linkers: an amino (N)-terminal Fe,S, activation 
domain (residues 1-57), a TIM barrel domain (residues 93-312) anda 
carboxy (C)-terminal B,-binding domain (residues 325-446). With 
the exception of the Fe,S, and B;, domains, discussed below, both 
CFeSP copies align well to the ChCFeSP structure, r.m.s.d. 0.81-0.85 A 
(Supplementary Fig. 2b). 
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Figure 1 | The overall CFeSP/MeTr complex. Ribbon representation of MeTr 
homodimer (MeTr) in light and dark pink, CFeSP small subunits in orange, 
CFeSP large subunit Fe,S, domains in teal and cyan, TIM barrel domains in 


In the 220 kDa CFeSP/MeTr assembly (Fig. 1), the MeTr homodi- 
mer lies in the centre, with one CFeSP bound on either side. Each MeTr 
monomer has a C-terminal o-helix (residues 255-262) protruding 
from the TIM barrel rim. Contacts between this helix and its preceding 
loop with a CFeSP small subunit helix (residues 191-204) form the 
primary interactions between MeTr and CFeSP (Supplementary Fig. 3). 
Weak interactions between MeTr and CFeSP Fe,S, domains have 
stabilized these highly flexible’ domains responsible for Bj, activation, 
allowing their visualization as bundles of short «-helices connected by 
long loops that coordinate the Fe4S, cubane (Fig. 1 and Supplementary 
Figs 4 and 5). The Fe,S4 domains are observed to adopt a variety of 
positions that are all too far from the B,, to afford reductive activation’. 
However, the long and primarily unstructured protein linkers that 
connect both the Fe,S, and B,. domains to the central TIM barrel 
must allow for the requisite flexibility for B,, activation (Supplemen- 
tary Figs 2b and 6). 

Bj. domains of both MetH and CFeSP adopt Rossmann-like 
architectures that bind B,, in the base-off conformation (Fig. 1)*”. 
High B-factors support the notion of flexibility mentioned above (Sup- 
plementary Fig. 7 and Supplementary Table 2), where electron density 
for both B,, domains represents a highest occupancy position within 
an ensemble, rather than a sole conformation. In both CFeSPs, the 
average B,> domain position resides between the ‘capping’ small sub- 
unit TIM barrel and the TIM barrel of a MeTr monomer, which are 
adjacent and nearly perpendicular to each other (Fig. 1). On average, 
the Bj. Co has shifted approximately 6.5A away from its ‘resting’ 
location towards the MeTr folate-binding site. B,, in this structure is 
thus positioned ‘en route’ towards catalysis, with approximately 18 A 
remaining to the methyl group of folate modelled into the MeTr active 
site, based on an alignment with the folate-bound MeTr structure’®. In 
transitioning between ‘resting’ and ‘en route’ positions, the B,2 corrin 
ring breaks three interactions with the ‘capping’ domain and forms 
new contacts, including an H-bond with Asn 203 of MeTr (Fig. 2a, b). 

Given the flexibility suggested by this structural analysis, we 
explored whether the B,, domain can sample the 18A necessary to 
afford turnover within intact crystals, using anaerobic in crystallo 
ultraviolet-visible absorption spectroscopy to monitor the state of 
B,>. In crystallo and analogous solution spectra were collected in 
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green and blue, and By, domains in dark green and dark blue. Bj. cofactors in 
magenta sticks with cobalt as violet spheres. Fe,S, clusters in spheres: Fe in 
orange, S in yellow. 


parallel (Fig. 3) for the as-isolated Co(II) form of B,>. Reduction to 
Co(I) and methylation to CH3-Co(III) were then achieved in crystallo 
and in solution, with all spectra matching well-established CFeSP 
absorption features’*’°’***, Importantly, these features disappear when 
light is passed through the solution surrounding the crystals, indicating 
that spectra represent protein in crystals and not protein that may have 
been liberated into the solution. Collectively, these data demonstrate 
enzymatic transfer of the CH3-H,folate methyl group to CFeSP-bound 
By», evidence that the B;, domain is able to move at least 18 Ato trigger 
methyl transfer within the crystal. To our knowledge, this conforma- 
tional movement represents the largest observed in a crystallized 
protein (Supplementary Discussion). Such dramatic B,, domain move- 
ment is probably facilitated by the fact that CFeSP/MeTr is mostly 
composed of rigid TIM barrels that provide all the lattice contacts 
(Supplementary Fig. 8). Although their biosynthesis is energetically 
expensive, these high molecular mass TIM barrel scaffolds may be 
important for B).-dependent methyltransferases to maintain structural 
integrity during the conformational gymnastics that alternately enable 
activation, protection and catalysis of the highly reactive B,, cofactor. 
Thus, despite the small size of the transferred methyl moiety, these large 
conformational changes appear to necessitate large enzyme sizes. 
Although the folate-free CFeSP/MeTr structure describes large B,2 
domain movements that ‘uncap’ B,. from the small subunit, it is 
interesting to consider why binding of CFeSP to MeTr does not simply 
position the B,;, domain directly over the MeTr active site. One 
explanation is that the structure represents an inactive complex; 
however, in crystallo results clearly demonstrate that CFeSP/MeTr 
crystals are active. Another explanation posits that an ensemble of 
“en route’ conformations exists when CH3-H,folate is absent, and that 
CH;-Hy,folate binding would shift the conformational equilibrium, 
moving Bj, closer to the folate-binding site. To obtain experimental 
support for this hypothesis, we solved additional CFeSP/MeTr struc- 
tures co-crystallized with CH;-Hy,folate, with and without Ti(IID) 
citrate as a reductant at 3.03 A and 3.50A resolution, respectively. 
Absorption spectroscopy performed on these crystals shows that these 
structures represent a substrate form (CH3-H,folate bound, By, in the 
Co(II) state) and a product form (Hyfolate bound, B,. in the CH3- 
Co(II) state) of the complex (Supplementary Fig. 9). Compared with 
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Figure 2 | Comparison of By. positions in ‘resting’ ChCFeSP, folate-free 
and folate-bound CFeSP/MeTr. a, ChCFeSP (grey ribbons, Co of Bj»: black 
sphere). b, Folate-free CFeSP/MeTr (green ribbons, Co of B,,: green sphere) 
superimposed with CH3-H,folate-bound MeTr (Protein Data Bank accession 
number 2E7F, pink ribbons). c, Folate-bound CFeSP/MeTr (orange ribbons, Co 
of Bj»: orange sphere). Parts a—c are identical in orientation; B,, sticks coloured 
as C, ribbon colour; O, red; N, blue; P, orange. d, Superposition of ChCFeSP 
(grey), folate-free CFeSP/MeTr (green), and folate-bound CFeSP/MeTr 


the folate-free structure, B,, in both folate-bound structures has 
indeed moved even closer to the MeTr folate-binding site (by an 
average of 7.7 A) and exhibits new H-bonding features (Fig. 2c). In 
these folate-bound structures, the B,, corrin ring has severed all inter- 
actions with the ‘capping’ CFeSP small subunit and contacts only 
MeTr residues. Here, asparagine and glutamine residues that line the 
MeTr surface appear to participate in an ‘amide hand off, sequentially 
passing B, along its trajectory as it progresses towards folate (Fig. 2b-c 
and Supplementary Fig. 10). 

Interestingly, the terminal amide in this ‘hand off, Asn 199, is 
strictly conserved in both MeTr and MetH and was previously shown 
to switch conformations between folate-free and folate-bound forms”®, 
a feature also observed in the CFeSP/MeTr structures presented here 
(Fig. 2b, c, e). Inapo-MeTr structures, Asn 199 points upwards and out 
of the active site, whereas in folate-bound MeTr structures Asn 199 
turns down to H-bond with the N’ of folate. Because N199A mutation 
moderately hinders folate binding (20-fold in dissociation constant, 
Kg) but dramatically compromises catalytic efficiency (keat/Km) by 
25,000-fold, Asn 199 is thought to be important for formation of the 
transition state’®. In our CFeSP/MeTr structures, we observe a new role 
for Asn 199 in By, domain conformational switching: when folate is 
absent, Asn 199 points out of the active site, blocking a closer By. 
position. However, when folate binds and Asn 199 reorients to 
H-bond with folate, space becomes available for B,;, to move closer 
to the MeTr folate-binding site. Therefore, the position of Asn 199 
itself could help shift the conformational equilibrium of the By 
domain, signalling that substrate has bound to MeTr. Asn 199 is an 
ideal signal for substrate binding, as it is the only MeTr residue known 
to reposition upon folate binding”. 
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N199 


(orange) structures in a-c, highlighting one helix (thick ribbons) to show 
clamping motion (helix axes as straight blue lines) and By, (sticks) with 12- 
residue linker (thick ribbons) to By), domain (surface) to show swinging motion. 
e, Superposition of B,, and CH3-Hyfolate in d, with Asn 199 shown for CFeSP/ 
MeTr structures in sticks (C, ribbon colour; O, red; N, blue). f, Same as e, with 
2F, — F. density in blue (1.0c) and pink mesh (4.0¢), and F, — F, density in 
green mesh (3.0c) for folate-bound CFeSP/MeTr structure. Putative alternative 
Bj» corrin: cyan. g, Superposition of B,2 cofactors and CH3-Hafolate in f. 
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Figure 3 | Methyl transfer activity of CFeSP/MeTr crystals by ultraviolet- 
visible absorption spectroscopy. As-isolated spectra (blue lines) for CFeSP/ 
MeTr crystals and CFeSP in solution similarly have broad features at 
approximately 400 and 470 nm arising from the Fe,S, cluster and the B,, corrin. 
Following established protocols'*"°”*??””7, B,, reduction was achieved with 
Ti(II]) citrate, yielding a sharp 390 nm peak indicative of active Co(I) in both 
solution and in crystallo spectra (black lines). Further treatment with CH3- 
Hyfolate yields decreased absorbance at 390 nm and a new peak at 450 nm (red 
lines), characteristic of the product complex (protein-bound CH3-Co(III)'*!?*”*). 
A control reaction (green line) confirms that turnover does not occur from free 
CH3;-H,folate without MeTr, and the 450 nm peak indicates that B,, remains 
CFeSP-bound (free B,. has a peak at approximately 520 nm instead’**!””). 
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“Resting” CFeSP: 
Co(|) protected 
(ChCFeSP structure) 


B,, “en route” 
(this work) 


B,, equilibrium shifted 

(this work) 
Figure 4 | Cartoon model of B,2-dependent methyl transfer in CFeSP/ 
MeTr. For simplicity, only one of the two CFeSP heterodimers is shown. 
Protein domains are coloured as in Fig. 1, loops represent linkers, red hexagon 


Displacement of the B,, domain from its resting position to the posi- 
tion nearest the folate-binding site can be attributed to two independent 
conformational changes within the complex, best described as ‘swinging’ 
and ‘clamping’ motions (Fig. 2d). The B,, domain can ‘swing’ relative to 
the rest of CFeSP (Supplementary Fig. 11), and CFeSP can ‘clamp’ the B,2 
domain towards the MeTr active site (Fig. 2d and Supplementary 
Fig. 12). Despite varying degrees of ‘clamping’ over a range of approxi- 
mately 14° across the structures, the interface between the CFeSP small 
subunit and MeTr is preserved (Supplementary Fig. 13). 

Although folate binding shifts the average position of the By, 
domain closer to the MeTr folate-binding site, the B,2 Co is still too 
far for Sy2 methyl transfer (Fig. 2e). Intriguingly, a large, continuous 
electron density peak is present in 2F, — F,, F, — F,, and composite 
omit maps, emanating from the corrin ring and stretching directly over 
the folate-binding site, suggestive of an alternative, low-occupancy 
corrin conformation (Fig. 2f and Supplementary Fig. 14). A trial 
refinement of a putative corrin ring at 40% occupancy satisfies the 
F, — F, difference maps (Supplementary Fig. 15), positioning B,2 over 
the folate-binding site. 

The multiple positions of B,, captured here (Fig. 2g) highlight the 
conformational flexibility of the CFeSP/MeTr scaffold and provide a 
framework to understand the molecular juggling of domains during 
B,.-dependent methyl transfer (Fig. 4). Before MeTr binding, the 
CFeSP B,, domain rests against the ‘capping’ small subunit, as in 
the structure of ChCFeSP”, with reactive Co(I) of B,2 protected (‘rest- 
ing’ state). From this conformation, either the ‘cap’ or the B,, domain 
must move to allow substrate access. Our folate-free CFeSP/MeTr 
structure indicates that upon MeTr binding, the B;, domain becomes 
‘loosened’ and flexible, adopting an ensemble of conformations that lie 
en route towards the MeTr active site. Here, the reactive B,2 species 
would be protected by the CFeSP small subunit and MeTr TIM barrels. 
CH3-H,folate binding to MeTr accompanied by movement of Asn 199 
shifts the equilibrium of B;, domain conformers, placing B,, closer to 
folate, as in our folate-bound CFeSP/MeTr structures, with B,, pro- 
tection afforded by MeTr. It is notable that even after CH3-H,folate 
binds, the major B,, position is still not directly over the folate methyl 
group, as such a position is expected to be transient. After methyl 
transfer, the B}, domain can return to the small subunit to ‘re-cap’ 
the methylated B,, product, protected by the small subunit TIM barrel. 

Overall, our data indicate that B,, domain movement is not a simple, 
two-state switch between ‘resting’ and ‘catalytic’ conformations. 
Instead, a flexible B;, domain samples an ensemble of conformations, 
where subtle shifts of the conformational equilibrium place B,, pro- 
gressively closer to the active site, thereby increasing the population of 
conformers capable of methyl transfer without obstructing substrate 
access or hindering domain movement. This model is consistent with 
MetH studies where ligation, alkylation and redox state of the By, 
cobalt can favour/disfavour various binding modes, alternately shifting 
the equilibrium of conformers for ordered domain rearrangements 
during the reaction cycle’*'***”*. We further identify MeTr residues 
that contact B;, along its trajectory, ending with Asn 199. In MetH the 
B,, ligating residue His 759 has been shown to play a dual role in 
catalysis and in signalling conformational shifts’’. The strictly con- 
served, folate-binding Asn 199 of MeTr could similarly play a dual role 
in both catalysis and conformational signalling. We thus expect this 
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Folate-free complex: CH,-H,folate-bound complex: “Folate-on” conformation: 


Methyl transfer reaction 


Product bound: 
CH,-Coilll) protected 


H,folate-bound complex: 
B,» equilibrium shifted 


(transient) (this work) 


is Bj, blue rectangle is folate and transferred methyl group is shown as a yellow 
sphere. Curved arrows denote ‘swinging’ and ‘clamping’ motions. 


model for dynamic domain juggling, communicated by residues 
involved in substrate and cofactor binding, to be a common theme 
in methyl transfer between the B vitamins folate and Bj. 


METHODS SUMMARY 


CFeSP and MeTr were expressed and purified anaerobically from M. thermoacetica 
ATCC 39073 and from recombinant Escherichia coli, respectively. Crystals were 
grown anaerobically by hanging drop vapour diffusion. Diffraction data were 
collected at 24ID-C at the Advanced Photon Source, Argonne National Laboratory, 
and 5.0.2 and 8.2.2 at the Advanced Light Source, Lawrence Berkeley National 
Laboratory. Structures were solved by molecular replacement. Data collection and 
refinement statistics are presented in Supplementary Table 1, and representative 
electron densities for protein domains and for cofactors/substrate are shown in 
Supplementary Figs 16-21. Solution and in crystallo absorption spectra were collected 
as described in the text and in the Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Protein purification. CFeSP was expressed and purified anaerobically from M. 
thermoacetica ATCC 39073 as described”', except for the following modifications. 
CFeSP was purified from cell extracts using DEAE-cellulose and high-resolution 
Q-Sepharose anion exchange chromatography followed by phenyl-Sepharose 
hydrophobic interaction chromatography. Fractions containing CFeSP were con- 
centrated and buffer exchanged using Amicon ultracentrifuge concentrators in the 
anaerobic chamber. MeTr was expressed and purified anaerobically from recom- 
binant E. coli as described®. Concentrations of CFeSP and MeTr protein samples 
were determined using the Rose-Bengal method” and kept in storage buffer: 
50 mM Tris-HCl, pH 7.6, 100mM NaCl, 2 mM dithiothreitol. 

Crystallization. Crystals of the folate-free CFeSP/MeTr complex were grown by 
hanging-drop vapour diffusion in an anaerobic chamber (Coy Laboratories) at 
room temperature by adding 1 ul of precipitant (100mM Bis-Tris, pH 6.5, 
100mM calcium acetate, 9% PEG 5,000 monomethyl ether, 20% glycerol) to 
2 il ofan equimolar mixture of CFeSP and MeTr (approximately 250 1M monomer 
for each), over a 0.5 ml reservoir solution of precipitant. Large, brown, rod-shaped 
crystals appeared overnight. Crystals were looped and cryo-cooled in liquid 
nitrogen anaerobically before collection of X-ray diffraction data at 100K. 
Crystals of CFeSP/MeTr co-crystallized with the CH3-Hufolate substrate were 
obtained in the same manner as above, except the protein solution also contained 
CH3-H,folate at 1mM concentration. Crystals of CFeSP/MeTr co-crystallized 
with both CH3-Hyfolate and Ti(III) citrate as a reductant were obtained in the 
same manner, except the precipitant solution also contained Ti(II]) citrate at 3 mM 
concentration. 

Structure determination of folate-free CFeSP/MeTr structure. Two X-ray dif- 
fraction data sets were collected for the folate-free CFeSP/MeTr structure. A 
lower-resolution data set (3.3 A) was collected at the Advanced Light Source 
(ALS) beam line 5.0.2 (A= 1.1000 A), and a higher-resolution data set (2.38 A) 
was later collected at the Advanced Photon Source (APS) beam line 24ID-C 
(2 = 0.9792 A). 

The initial data set to 3.3A resolution was processed in HKL2000 and 
Scalepack”’. The structure was solved by molecular replacement in Phaser”®, using 
individual structures of MeTr’® (Protein Data Bank accession number 2E7F) and 
ChCFeSP? (Protein Data Bank accession number 2H9A) lacking its B;, domain as 
independent search models. Two CFeSP/MeTr complexes (approximately 
220kDa each) were found in the asymmetric unit, and crystals belonged to the 
space group P2,2,2, with unit cell dimensions (A): a = 137.42, b= 159.87 and 
c = 241.92. Iterative rounds of refinement with residue-grouped B-factors were 
performed in CNS*! and PHENIX”, with model building in Coot**. The four B,, 
domains present in the asymmetric unit were kept as a polyalanine model. Final 
R-factors for working and test reflections (Ryork and Rgree) Were 29.2% and 33.7%, 
respectively, when refinement of the structure to higher resolution began. Data 
collection and refinement statistics for this data set are shown in Supplementary 
Table 1. Ramachandran analysis was performed in PROCHECK™: 74.0% of resi- 
dues resided in the most favoured region, with 21.2% additionally allowed, 3.1% 
generously allowed and 1.7% disfavoured. 

The data set to 2.38 A resolution was processed in HKL2000 and Scalepack”. 
Although this crystal formed in similar conditions as the crystal which gave the 
3.3 A resolution data set, the space group was now P2,2,2, with unit cell dimen- 
sions (A): a = 125.71, b = 242.84 and c = 79.67. The structure for this crystal was 
thus solved by molecular replacement in Phaser”® using the MeTr homodimer and 
CFeSP heterodimers lacking B,, domains from the previously refined model of the 
3.3 A resolution structure as independent search models. Only one CFeSP/MeTr 
complex was present in the asymmetric unit. Iterative rounds of refinement were 
performed in CNS* and PHENIX”, with model building in Coot**. Translation/ 
libration/screw refinement was performed in latter refinement rounds with seven 
translation/libration/screw groups: the MeTr homodimer (chains A and B), the 
Fe,S, domain of one CFeSP large subunit (chain C), the TIM domain of chain C 
with one small subunit (chain D), the B}, domain of chain C, the Fe,Sy domain of 
the second large subunit (chain E), the TIM domain of chain E with the second 
small subunit (chain F), and the B,. domain of chain E. Data collection and 
refinement statistics are shown in Supplementary Table 1, and average B-factors 
for each domain of the final model are given in Supplementary Table 2. 
Ramachandran analysis was performed in PROCHECK™: 90.1% of residues reside 
in the most favoured region, with 9.5% additionally allowed, 0.3% generously 
allowed and 0.2% disfavoured. The final model contains residues 1-262 (of 262) 
for both MeTr chains (A and B), residues 2-442 (of 446) for both CFeSP large 
subunit chains (C and E) and residues 1-323 (of 323) for both CFeSP small subunit 
chains (D and F). 

Except for the Fe,S, and B;2 domains, the entire structure is composed of TIM 
barrels for which the electron density is well-defined (Supplementary Fig. 16). 
Electron density is weaker for the Fe,S, domains (Supplementary Figs 5, 17 and 


19), consistent with the fact that these domains exhibit higher B-factors 
(Supplementary Table 2 and Supplementary Fig. 7). However, reasonable electron 
density is present for the main chain and most side chains, allowing us to build a 
model for this domain. Still, several side chains of the Fe,S, domains lack clear 
electron density; thus, for these residues, atoms were truncated past the CB atom 
(chain C, 12 residues truncated and chain E, 16 residues truncated, out of 56 total 
residues). 

Although B-factors are high and electron density is weak for the B,2 domains in 
general, electron density for the B,, cofactors is unambiguous (Supplementary Fig. 
20), and density is also clear in several helical regions, including those near B,2 
(Supplementary Fig. 18). Because the structure of a CFeSP B,, domain bound with 
By» was already known’, we used the clear electron density of the By. cofactor and 
the resolvable helices to position the B;, domain during model building. Still, 
many side chains of the CFeSP B,2 domains lacked clear electron density, and 
thus for these residues, atoms were truncated past the CB atom (chain C, 50 
residues truncated and chain E, 59 residues truncated, out of 118 total residues). 

The B;» cofactor in the final model contains 5,6-dimethylbenzimidazole as the 
lower ligand moiety, as in cobalamin. Although active with cobalamin*’, previous 
studies have shown that CFeSP isolated from M. thermoacetica harbours an 
unusual B,, derivative that contains 5’-methoxybenzimidazole as the lower ligand 
instead**. However, disorder of the B, cofactor and B,2 domain owing to thermal 
motion of these regions in the CFeSP/MeTr crystal resulted in weak electron 
density for substituents of the benzimidazolyl ring (Supplementary Figs 18 and 
20). Therefore, we cannot confirm the presence of this unusual B,, derivative from 
our crystallographic studies, and we have thus modelled cobalamin as the form of 
B,, in the structure. 

Previous spectroscopic studies” in addition to the crystal structure of ChCFeSP” 
have indicated that a water molecule coordinates the central cobalt of B,, in the 
as-isolated CFeSP. Here, Co(II) is the major species and is expected to be five- 
coordinate. However, because of disorder we do not observe electron density to 
suggest a water molecule bound to cobalt (Supplementary Figs 18 and 20). 
Accordingly, we have not modelled a water molecule. 

Structure determination of folate-bound CFeSP/MeTr structures. For crystals 
grown with CH3-H,folate, X-ray diffraction data were collected at APS beam line 
24ID-C to 3.50 A resolution at A = 1.6039 A to optimize the cobalt peak anomalous 
signal. For crystals grown with both CH3-H,folate and Ti(III) citrate, X-ray dif- 
fraction data were collected at ALS beam line 8.2.2 to 3.03A resolution at 
A=1.0000 A. The structures were solved by molecular replacement using the 
MeTr homodimer and CFeSP heterodimers lacking Fe,S4 and B,, domains from 
the folate-free 2.38A CFeSP/MeTr structure as independent search models. 
Refinement of the folate-free CFeSP/MeTr structure against either folate-bound 
X-ray data set was not sufficient to solve the structure, as the unit cell dimensions 
were markedly different (Supplementary Table 1). After molecular replacement, 
one CFeSP/MeTr complex was present in the asymmetric unit, and omit electron 
density clearly indicated the presence of bound folate (Supplementary Fig. 21). 
Iterative rounds of refinement were performed in CNS*’ and PHENIX”, with 
model building in Coot**. The same test set of reflections for Rg. calculations 
was used for both folate-bound data sets. Data collection and refinement statistics 
are shown in Supplementary Table 1. Ramachandran analysis was performed in 
PROCHECK**: for the CH3-H,folate-only structure, 89.6% of residues reside in 
the most favoured region, with 9.8% additionally allowed, 0.3% generously allowed 
and 0.2% disfavoured. For the CH3-H,folate with Ti(IID citrate structure, 89.5% of 
residues reside in the most favoured region, with 10.0% additionally allowed, 0.2% 
generously allowed and 0.3% disfavoured. The final models both contain folate, B,» 
and residues 1-262 (of 262) for MeTr chains (A and B), residues 2-442 (of 446) for 
CFeSP large subunit chains (C and E) and residues 1-323 (of 323) for CFeSP small 
subunit chains (D and F). As with the folate-free structure, several side chains of the 
Fe,S4 domains for both folate-bound structures lacked clear electron density; thus 
for these residues, atoms were truncated past the CB atom (chain C, 15 residues 
truncated and chain E, 18 residues truncated, out of 52 total residues). Similarly, 
many side chains of the Bj, domains lacked electron density, and thus for these 
residues, atoms were truncated past the CB atom (chain C, 51 residues truncated 
and chain E, 78 residues truncated, out of 118 total residues). The liganded/oxida- 
tion states of folate and B,, in these structures were determined by use of a 
microspectrophotometer (see below). 

Solution and in crystallo ultraviolet-visible absorption spectroscopy to deter- 
mine enzyme activity in crystallo. Titanium(II]) citrate (100 mM in 50 mM Tris, 
pH7.6) was prepared”, and (6S)-5-methy1-5,6,7,8-tetrahydrofolate (CH3-H,folate) 
containing one glutamate tail was purchased from Schircks Laboratories. As- 
isolated, reduced and methylated CFeSP samples in solution were prepared in a 
room-temperature anaerobic chamber (MBraun) following similar procedures to 
those previously described'*"’***”°’, Briefly, purified CFeSP (20 1M) was used 
for the as-isolated sample, CFeSP mixed with Ti(III) citrate (1 mM) was used for 
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the reduced sample, and CFeSP mixed with equimolar MeTr, Ti(III) citrate 
(1 mM), and CH3-H,folate (1 mM) was used for the methylated sample. Spectra 
were taken using a Nanodrop 2000c (Thermo Scientific) in a quartz cuvette or on 
the sample stage in the anaerobic chamber directly after mixing; identical solutions 
lacking CFeSP were used as blanks. 

To obtain in crystallo absorption spectra, CFeSP/MeTr crystals in as-isolated, 
reduced and methylated forms were prepared in a similar fashion. In a room- 
temperature anaerobic chamber (Coy Laboratories), crystals were looped into a 
2 ul drop, which was placed on a cover slide and contained one of the following 
three solutions for as-isolated, reduced and methylated samples, respectively: well 
solution, well solution with Ti(III) citrate (10 mM) and well solution with Ti(IID) 
citrate (10 mM) and CH;-Hy,folate (1 mM). A ring of epoxy surrounding each 
drop was applied to the cover side, and a second cover slide was placed on top, 
sandwiching the drops within a uniform distance separation and sealing the 
crystals within an anaerobic environment. Upon curing of the epoxy, crystals were 
brought out of the anaerobic chamber and mounted on an XZ translation stage 
(Newport, UMR8.25 and SM-13) ina fibre optic coupled microspectrophotometer 
(Ocean Optics, Jaz) with 40 mm diameter reflective objectives (Optique Peter) and 
a deuterium-halogen lamp (DH2000-BAL, Ocean Optics), similar to that previ- 
ously described**””. Stray light was blocked with blackout material. The light focus 
was coarsely aligned to the crystals by visual inspection and then finely aligned by 
monitoring light transmission in real time. Data were acquired at room temper- 
ature with the SpectraSuite software (Ocean Optics). The background transmis- 
sion was measured through the solution immediately surrounding the crystals. 
The dark current was measured with the light shuttered off. Sample, reference and 
dark current spectra were acquired by averaging 10-50 scans with total exposure 
times of 90-1000 ms. Experiments were completed within 60 min of sample pre- 
paration, and crystals remained intact over the course of the experiment, as 
observed using a microscope after data collection. 

To generate Fig. 3, absorbance spectra were scaled relative to each other to 
account for variable crystal sizes and path lengths, where absolute peak absor- 
bances did not exceed one absorbance unit. 

In crystallo ultraviolet-visible absorption spectroscopy on folate-bound 
CFeSP/MeTr crystals to determine liganded/oxidation state of bound B,2 
and folate. Ultraviolet-visible absorption spectra were collected on a micro- 
spectrophotometer at 100K for crystals of folate-free CFeSP/MeTr, crystals that 
were grown in the presence of CH3-H,folate only and for crystals grown in the 
presence of both CH3-H,folate and Ti(III) citrate (Supplementary Fig. 9). The 
spectra were compared with the analogous solution spectra (Fig. 3). The spectrum 
for the folate-free crystal was similar to the spectrum of CFeSP alone in solution, 
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with broad features at approximately 400 and 470 nm indicative of the Fe,S, 
cluster and Bj, primarily in the Co(II) state. The spectrum for the crystal grown 
with CH3-H,folate matched the spectrum of folate-free crystals, indicating that 
By> had remained primarily in the Co(II) state, and turnover had not occurred. 
However, the spectrum for the crystal grown with both CH3-H,folate and Ti(III) 
citrate was markedly different and contained a peak at 450 nm, indicating that B,» 
in these crystals was methylated to the CH3-Co(III) state. Based on these data, we 
modelled the methyl group on folate in the structure co-crystallized with CH3- 
H,folate only, whereas we modelled the methyl group bound to Co of B,> for the 
CH3-Hy,folate/Ti(III) citrate structure. Without these spectroscopic data, assign- 
ment of the location of the methyl group would otherwise have been prevented by 
the resolution limits of the data. 
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Structure and mechanism of a glutamate-—GABA 


antiporter 
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Food-borne hemorrhagic Escherichia coli, exemplified by the 
strains O157:H7 and O104:H4 (refs 1, 2), require elaborate acid- 
resistance systems (ARs)’ to survive the extremely acidic environ- 
ment such as the stomach (pH ~ 2). AR2 expels intracellular 
protons through the decarboxylation of L-glutamate (Glu) in the 
cytoplasm and exchange of the reaction product y-aminobutyric 
acid (GABA) with extracellular Glu. The latter process is mediated 
by the Glu-GABA antiporter GadC*”, a representative member of 
the amino-acid—polyamine-organocation superfamily of membrane 
transporters. The functional mechanism of GadC remains largely 
unknown. Here we show, with the use of an in vitro proteoliposome- 
based assay, that GadC transports GABA/Glu only under acidic con- 
ditions, with no detectable activity at pH values higher than 6.5. We 
determined the crystal structure of E. coli GadC at 3.1 A resolution 
under basic conditions. GadC, comprising 12 transmembrane 
segments (TMs), exists in a closed state, with its carboxy-terminal 
domain serving as a plug to block an otherwise inward-open con- 
formation. Structural and biochemical analyses reveal the essential 
transport residues, identify the transport path and suggest a con- 
served transport mechanism involving the rigid-body rotation of a 
helical bundle for GadC and other amino acid antiporters. 

Other homologous amino-acid—polyamine-organocation (APC) 
family members include the key AR3 component arginine-agmatine 
(Arg-Agm) antiporter AdiC*’ (Fig. 1a), the lysine-cadaverine antiporter 
CadB, and the putrescine-ornithine antiporter PotE. Structural 
analysis of AdiC*'' revealed a conserved LeuT fold that is associated 
with the Na‘ -coupled symporters'*! and a proton-coupled transporter 
ApcT"®. Ligand-free AdiC exists in an outward-open conformation*”®, 
and binding of Arg triggers a major structural rearrangement resulting 
in an occluded conformation’. Despite these advances, the transport 
mechanism for AdiC or any other amino acid antiporter remains 
poorly understood. The inward-open conformation has yet to be 
captured for any amino acid antiporter. 

The full-length, wild-type (WT) GadC (residues 1-511), derived 
from the E. coli strain O157:H7, was purified to homogeneity. To char- 
acterize GadC, we reconstituted a proteoliposome-based transport assay 
(Fig. 1b), in which substrate transport was monitored through the 
detection of *H-labelled Glu. Incorporation of GadC into the liposome 
allowed the rapid accumulation of Glu at pH 4.5 (Fig. 1b; Supplemen- 
tary Fig. 1). Strikingly, substrate transport by GadC decreased sharply 
with increasing pH values; the accumulation of Glu after 15 s at pH_ 5.0, 
5.5 and 6.0 was about 67%, 32% and 8% of that at pH 4.5 (Supplemen- 
tary Fig. 1). AtpH 6.5 and above, there was no detectable accumulation 
of Glu (Fig. 1b). Thus, substrate transport by GadC is strictly pH- 
dependent, with robust activity at pH. 5.5 or below. This biochemical 
property not only helps in the survival of enterobacteria under acidic 
environment, but it may also be important for avoiding unnecessary 
proton efflux under neutral pH growth conditions. 

AdiC has been thoroughly characterized, with elevated transport 
activities at pH 4 and 6 (ref. 17); however, AdiC still allows moderate 
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transport even under basic conditions such as pH8 (ref. 17). We 
compared the activities of AdiC and GadC under identical sets of 
experimental conditions (Fig. 1c, d). The results confirmed and 
extended published findings. The total substrate accumulation in the 
proteoliposomes over 10 min at pH 9.0 was about 21% of that at pH 5.0 
(Fig. 1c). In sharp contrast, GadC has no detectable activity at pH 6.5 
or higher. Determination of the maximal transport activity (Vinax) and 
Ky for AdiC and GadC at different pH values confirmed these obser- 
vations (Fig. 1d and Supplementary Fig. 2). Thus, in comparison with 
AdiC, GadC exhibits much more stringent dependence on pH for 
substrate transport. 

Next we replaced GABA individually with 19 natural amino acids; 
the substrate transport for each amino acid was measured at pH 5.5 
(Fig. le). In addition to Glu, GadC also efficiently transports three 
additional amino acids: Gln and, to a smaller extent, Met and Leu. 
This result was unexpected because, unlike Glu, none of the three 
amino acids carries a charge or can be protonated on the side chain. 
Notably, the Vinax for Gln is considerably larger than that for Glu or 
GABA at pH5.5, whereas the K,, for Gln is comparable to that for 
GABA (Supplementary Fig. 3). In particular, GadC allows little, if any, 
transport of the amino acids Asp, Phe, Gly, His, Lys, Asn, Pro and Trp 
(Fig. le). Substrate transport for Asp and Asn was less than 5% of that 
for the chemically similar amino acids Glu and Gln, respectively. This 
analysis strongly indicates that GadC is highly selective in substrate 
transport. 

Efflux of protons presumably contributes to a build-up of a nega- 
tive potential within the lipid membrane, which is known to be 
unfavourable for continued substrate transport by amino acid 
antiporters such as AdiC’” and AspT'*®. We examined Glu-GABA 
exchange in the proteoliposome-based assay in the absence or presence 
of valinomycin (Supplementary Fig. 4). The results clearly demonstrate 
that positive potential within the proteoliposomes stimulated substrate 
transport, whereas negative potential led to decreased transport. 

We crystallized the full-length GadC in the space group P2;22, at 
pH8.0 and determined the structure by platinum-based single-wave- 
length anomalous dispersion (SAD) at 3.1 A resolution (Supplemen- 
tary Tables 1 and 2; Supplementary Figs 5 and 6). Each asymmetric 
unit contains two molecules of GadC, arranged in an antiparallel 
fashion (Supplementary Fig. 5). This packing pattern suggests that, 
similarly to AdiC'® and the transporter CIC”’, the functional entity 
for GadC is likely to be a single molecule. GadC contains 12 TMs, 
with TM1 and TM6 each containing two short o-helices connected 
by a discontinuous stretch in the middle (Fig. 2a and Supplementary 
Fig. 7). The structure of GadC, together with the identification of two 
periplasmic loops in GadC’’, allows an unambiguous assignment of its 
membrane topology. Ina similar manner to AdiC and other LeuT-type 
transporters, GadC contains two inverted repeats, TM1-TM5 and 
TM6-TM10, which are related to each other by a pseudo-two-fold axis 
(Supplementary Fig. 8a). The periplasmic and cytoplasmic sides of 
GadC are highly charged (Supplementary Fig. 8b). 
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Figure 1 | Functional characterization of GadC. a, Schematic diagram of 
AR2 and AR3 in E. coli. The antiporter GadC of AR2 exchanges extracellular 
Glu for intracellular GABA, resulting in the net efflux of one proton per cycle. 
Glu is decarboxylated by GadA/B to become GABA in cells. The Arg-Agm 
antiporter AdiC and the decarboxylase AdiA are the equivalents of GadC and 
GadA/B, respectively. b, GadC shows pH-dependent transport in the 
proteoliposome assay. *H-labelled Glu was present at roughly 0.19 uM to 
permit the measurement of transport. The transport activity is robust at pH 4.5 
and rapidly decreases with increasing pH, with no detectable transport activity 


In comparison with AdiC and other homologues, GadC contains a 
unique, extended C-terminal fragment (Supplementary Fig. 9). In 
contrast with reported structures of AdiC*"', GadC seems to adopt 
an inward-open conformation (Fig. 2a). The open path leads to a 
negatively charged environment (Fig. 2b), where substrate-binding 
residues are likely to be located. However, the C-terminal fragment 
(residues 477-511) forms a folded domain and completely blocks the 
path to the putative substrate-binding site. The C-terminal fragment, 
with clear electron density (Supplementary Fig. 10a), is hereafter 
referred to as the C-plug. The observation of a blocked transport path 
in GadC is consistent with the fact that the crystals were generated at 
pH8.0, at which no transport activity could be detected (Fig. 1c). 

The location of the C-plug strongly suggests that its displacement is 
a prerequisite for the transport activity of GadC. To examine this, we 
generated a C-plug-deleted GadC variant (residues 1-470) and 
measured its ability to permeate Glu-GABA. Strikingly, whereas WT 
GadC showed little substrate transport over 60 min at pH 6.5, GadC (1- 
470) showed a significant level of Glu accumulation (Fig. 3a). 
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at pH6.5 or above. c, Comparison of pH-dependent substrate transport of 
AdiC and GadC. GadC exhibits stringent pH dependence, with no detectable 
transport activity at pH values higher than 7.0. By contrast, AdiC has 
considerable transport activity at pH 7.0, 8.0 and 9.0. d, Comparison of Vmax for 
AdiC and GadC at different pH values. e, The substrate specificity of GadC- 
mediated transport in the proteoliposome assay. Substrate transport was 
measured at pH 5.5 for 10 min. GadC only allows transport of GABA, Glu, Gln 
and (to a smaller extent) Met and Leu. All error bars represent the s.d. for three 
independent experiments. 


Additional measurements show that the transport activity of 
GadC (1-470) was rapidly decreased with increasing pH and became 
undetectable at pH7.5 or above (Fig. 3b). Overall, deletion of the 
C-plug in GadC shifted its pH-dependent substrate transport towards 
a higher pH. 

The C-plug contains several basic amino acids and makes multiple 
intra-domain and inter-domain hydrogen bonds (Fig. 3c and Sup- 
plementary Fig. 10b). The tightly folded conformation of the C-plug 
is stabilized by two centrally located basic residues, His491 and 
Arg 499. His491 donates a hydrogen bond to Ser500, whereas 
Arg 499 makes three hydrogen bonds to the main-chain atoms of 
Phe 492 and Leu 494. At one end of the plug, Tyr 503 forms a hydrogen 
bond to Ala 487. These intra-domain interactions are complemented 
by inter-domain contacts. At the other end of the plug, the guanidinium 
group of Arg 497 donates a hydrogen bond to Gln 98 on TM3. In 
addition, His 502 makes a hydrogen bond to Arg 314 on TM8, whereas 
the main chain atoms of Val 477 and Ser 484 form hydrogen bonds with 
Gln 321 on TM8 and Glu 226 on TM6. 
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Cytoplasm 


Figure 2 | Overall structure of GadC. a, Overall structure of the WT full- 

length GadC. TM1-TM10 are rainbow-coloured, with TM1 in blue and TM10 
in red. TM11 and TM12 are shown in grey. The C-terminal fragment (C-plug) 
is coloured magenta. b, The C-plug of GadC blocks an otherwise inward-open 


Perturbation of these interactions is predicted to compromise the 
stability of the C-plug and consequently to facilitate its displacement. 
To examine this prediction, we generated five GadC variants—H491A, 
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conformation. Two perpendicular views of GadC are shown. The C-plug blocks 
access to the negatively charged substrate-binding cleft (inset). All structural 
figures were prepared with PyMol”. 


R497A, R499A, H502A and Y503A—each carrying a missense muta- 
tion on a critical residue in the C-plug. We individually investigated 
substrate transport by these GadC variants at pH 6.5. Whereas the WT 
GadC exhibited little activity, all five GadC variants allowed varying 
but significant levels of substrate transport (Fig. 3d). In particular, the 
transport activities for GadC-H491A and GadC-R497A were similar 
to that of GadC (1-470) (Fig. 3d). By contrast, two additional GadC 
variants, H495A and H511A, allowed considerably less Glu accumula- 
tion, consistent with the observation that these mutations do not affect 
any critical interactions in the C-plug. 

The closed conformation of GadC is attained not only by the C-plug 
in the cytoplasm but also by the L7 loop at the periplasmic side 
(Supplementary Fig. 1la). The L7 loop interacts with surrounding 
structural elements through a combination of hydrogen bonds and 
van der Waals contacts (Supplementary Fig. 11b, c). We speculate that, 


Figure 3 | The C-plug regulates substrate transport. a, Truncation of the 
C-plug (residues 471-511) allowed transport of Glu at pH 6.5. Shown here is 
the total accumulation of Glu in the proteoliposomes in 60 min. b, Deletion of 
the C-plug shifts the pH-dependent transport activity of GadC towards higher 
pH values. c, The C-plug interacts with surrounding structural elements 
through multiple hydrogen bonds. The C-plug is coloured magenta and 
surrounding structural elements are shown in grey. d, Structural integrity of the 
C-plug is important for the transport activity of the WT GadC. Although the 
WT GadC does not allow apparent transport of substrates at pH 6.5, several 
missense mutants acquired this ability. These mutations probably led to 
compromised interactions between the C-plug and its surrounding structural 
elements. All error bars represent the s.d. for three independent experiments. 
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during each cycle of transport, the L7 loop must be displaced, at least 
transiently, to allow the passage of substrate molecules. The Ca—Ca 
distance between residue 267 on L7 and residue 364 on TM10 is about 
5.6A (Supplementary Fig. 11d), which is ideal for disulphide bond 
formation if these two residues are replaced by Cys. We generated a 
double mutation GadC-L267C/N364C and subjected the purified 
protein to oxidation by o-phenanthroline copper complex~’. The 
oxidized GadC variant (L267C/N364C) showed undetectable sub- 
strate transport in the absence of the reducing agent dithiothreitol 
but restored substrate transport in the presence of dithiothreitol (Sup- 
plementary Fig. 11d). By contrast, the oxidized WT GadC showed 
similar levels of substrate transport in the absence or presence of 
dithiothreitol. 

The available structures of AdiC*" greatly facilitated the identification 
of gating residues in GadC. The substrate transport path is sandwiched 
axially between the C-plug and the L7 loop and surrounded laterally by 
TM1, TM3, TM6, TM8 and TM10 (Fig. 4a). Sequence alignment with 
AdiC and other amino acid antiporters identified six potential gating 
residues in GadC: Tyr 96, Tyr 214, Glu218, Trp 308, Tyr 378 and 
Tyr 382 (Supplementary Fig. 9). These residues are located within or 
in close proximity to the putative transport path (Fig. 4a). Comparison 
of the spatial locations for these amino acids revealed a pattern that 
differed from that in AdiC of the outward-open conformation’ or Arg- 
bound occluded conformation’ (Supplementary Fig. 12). 

The distal gate in AdiC comprises three amino acids, with Glu 208 
hydrogen bonded to Tyr 93 and Tyr 365 (ref. 9). Glu 208 and Tyr 365 
in AdiC correspond to Glu218 and Tyr 382 in GadC, respectively 
(Supplementary Fig. 9). Tyr 378 is also located within hydrogen-bonding 
distance of Glu 218. Tyr 96 in GadC, which may correspond to Tyr 93 
in AdiC, is located roughly 5A away from the position of Tyr 93 
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(Supplementary Fig. 12). The middle gate residue Trp 293 in AdiC 
corresponds to Trp 308 in GadC, which is farther from the axial centre 
of transport, probably reflecting the nature of the inward-open con- 
formation. The proximal gate residue, Trp 202, in AdiC closes on bind- 
ing to Arg and occludes the substrate molecule from the periplasm”; the 
corresponding residue remains to be conclusively identified in GadC. 
The nearby residues Phe 210 and Tyr 214 are located in different posi- 
tions from that of Trp 202 in AdiC. This analysis suggests major con- 
formational changes for these putative gating residues during the 
substrate transport cycle. Several other amino acids are located within 
the putative transport path and may have a function in transport. In 
particular, Tyr 30 on TM1 donates a hydrogen bond to Glu 304 on TM8 
(Supplementary Fig. 13a). 

To corroborate the structural analysis, we generated six GadC 
variants—Y30A, E218A, E304A, W308A, Y378A and Y382A—each 
with a missense mutation targeting a putative gating residue. In con- 
trast with WT GadC, each mutation caused at least 90% decrease in 
substrate transport (Supplementary Fig. 13b). GadC-W308A, which 
affects the putative middle gate residue Trp 308, only retained about 
2% of the WT activity. By contrast, the GadC variant M25A, which 
does not affect any putative transport residue, showed about 74% of 
the WT transport activity. 

Under the assumption of a conserved transport mechanism 
between GadC and AdiC, the inward-open conformation of GadC, 
with omission of the C-plug, probably reflects a distinct state of AdiC 
during substrate transport. Structural alignment between outward- 
open AdiC® and GadC reveals a striking pattern of conformational 
changes that are concentrated in a helical bundle comprising TM1, 
TM2, TM6 and TM7 (Fig. 4b). For simplicity of discussion, these four 
TMsare collectively named the gate domain, and the rest is referred to 


Figure 4 | The transport path in GadC. 

a, Identification of key amino acids in the transport 
path. Sequence alignment and structural analysis 
identify six amino acids that may be essential in 
substrate transport. These residues are shown in a 
close-up view in the right panel. b, The gate domain 
in GadC, comprising TM1, TM2, TM6 and TM7, 
undergoes the most pronounced conformational 
changes compared with those [of the outward- 
open AdiC*. The core domain showed only minor 
changes. The gate and core domains of GadC are 
coloured green and grey, respectively; the gate and 
core domains of AdiC are coloured blue and 
yellow, respectively. c, The conformational changes 
of the gate domain amount to a rigid-body rotation 
of about 35°. d, Structural overlay of the gate 
domains from GadC (green) and AdiC (blue). 
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as the core domain. In contrast with that in AdiC, the gate domain in 
GadC seems to rotate clockwise by about 35° (Fig. 4c), resulting in an 
outward-closed and inward-open conformation in GadC. Structural 
alignment between the isolated gate domains of AdiC and GadC 
revealed only minor changes in these four TMs (Fig. 4d), suggesting 
a rigid-body movement between the gate and the core domains. This 
structural analysis supports the notion of alternating access for mem- 
brane transporters””*. The four TMs in the gate domain undergo the 
most drastic structural rearrangement among all 12 TMs of GadC 
(Supplementary Fig. 14). In comparison with the gate domain, TMs 
of the core domain have a considerably smaller degree of conforma- 
tional changes, particularly TM3, TM8 and TM9. 

The mechanistic conservation may go beyond the amino acid 
antiporters. Mhp1 was recently found to switch from outward-open 
to inward-open conformation through a tigid-body rotation involving 
two helical bundles”*. After the exit of Na’, the Na‘ -galactose trans- 
porter vVSGLT was also thought to undergo a minor rigid-body move- 
ment involving two helical bundles”. Of the two helical bundles, one 
contains TM3, TM4, TM8 and TM9, and the other comprises TM2, 
TM6 and TM7; this is true for Mhp1 (ref. 24), vVSGLT” and AdiC/ 
GadC, for all of which at least two distinct conformations ofa transport 
cycle have been structurally characterized. The helical bundle that 
executes the rigid-body movement is proposed to be the gate domain 
(TM1, TM2, TM6 and TM7) in GadC and the hash motif (TM3, TM4, 
TM8 and TM9) in Mhp1 (ref. 24). Thus, the moving and non-moving 
portions of GadC are exactly the reciprocal of those in Mhp1. The 
choice of the core domain in GadC is justified by the following analysis. 
Superposition between AdiC and GadC yields a root mean squared 
deviation of 2.98 A over 167 aligned Cx atoms if TM3, TM4, TMS, 
TM8, TM9 and TM10 are treated as the non-moving portion, and 
about 3.95 A over 143 aligned Ca atoms if TM1, TM2, TM5, TM6, 
TM7 and TM10 are treated as the non-moving portion. 

Structural elucidation of the Glu-GABA antiporter GadC is a step 
towards a detailed, mechanistic understanding of the amino acid 
antiporters. Many questions remain (see Supplementary Discussion). 
At present we have little information about how and to what extent the 
C-plug of GadC is dislodged during substrate transport. We have yet to 
explain the pH-dependent transport activity by amino acid antiporters 
such as GadC and AdiC. Conclusive answers to these questions require 
additional biochemical and structural investigation. 


METHODS SUMMARY 


All constructs were generated by standard PCR-based protocol. GadC was over- 
expressed in E. coli BL21(DE3). The proteins were purified to homogeneity by 
affinity chromatography and gel filtration. Crystals of GadC were grown by the 
hanging-drop vapour-diffusion method. The crystals belong to the space group 
P2,2;2); they were flash-frozen in a cold nitrogen stream at 100 K. X-ray data were 
collected at the Shanghai Synchrotron Radiation Facility (SSRF) beamline BL17U 
and SPring-8 beamline BL41XU. Data were processed with HKL-2000 (ref. 26). 
The platinum positions were determined with the program SHELXD”. Cross- 
crystal averaging with all three data sets combined with solvent flattening, 
histogram matching and non-crystallographic symmetry (NCS) averaging in 
DMMulti gave a map of sufficient quality for model building. An initial model 
was built into the experimental map by using COOT”. The sequence docking was 
aided with the selenium sites in the anomalous difference Fourier map. The 
structure was refined with PHENIX”. Proteoliposome assays were performed to 
determine transport activities of various GadC mutants. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Protein preparation. The cDNA of full-length GadC was subcloned into pET15b 
(Novagen). Overexpression of GadC was induced by 0.2mM isopropyl B-p- 
thiogalactoside (IPTG) when the cell density reached a Dgoo of 1.2. After growth at 
37 °C for 4h, the cells were collected, homogenized in buffer containing 25 mM Tris- 
HCl pH 8.0 and 150mM NaCl. After further disruption with a French press, cell 
debris was removed by low-speed centrifugation for 10 min. The supernatant was 
collected and ultracentrifuged for 1h at 150,000g. The membrane fraction was col- 
lected and homogenized with buffer containing 25mM Tris-HCl pH8.0 and 
150 mM NaCl. N-Octyl-B-p-glucopyranoside (B-OG; Anatrace) was added to the 
membrane suspension to a final concentration of 2% (w/v) and then incubated for 2 h 
at 4 °C. After another ultracentrifugation step at 150,000g for 30 min, the supernatant 
was collected and loaded on Ni** -nitrilotriacetate affinity resin (Ni-NTA; Qiagen), 
followed by a wash with 25 mM Tris-HCl pH 8.0, 500 mM NaCl, 20 mM imidazole, 
0.4% n-nonyl-B-b-maltopyranoside (NM; Anatrace). After proteolytic removal of 
the hexahistidine (His,) tag on the column, GadC was eluted with buffer containing 
25 mM Tris-HCl pH 8.0, 150 mM NaCl and 0.4% NM. After concentration to 10- 
15mgml ', GadC was further purified by gel filtration (Superdex-200 10/30; GE 
Healthcare). The buffer for gel filtration contained 25 mM Tris-HCl pH 8.0, 150 mM 
NaCl and various detergents. The peak fractions were collected. The GadC mutants 
were generated with a standard PCR-based strategy and were subcloned, over- 
expressed and purified in the same way as the WT protein. 

Crystallization. The hanging-drop vapour-diffusion method was performed at 
18°C during crystallization. Crystals belonging to the space group P2,2;2; were 
obtained with protein purified in the presence of 0.2% n-nonyl-f-b-glucopyranoside 
(B-NG; Anatrace) and 0.023% n-dodecyl-N,N-dimethylamine-N-oxide (LDAO; 
Anatrace). The crystallization buffer was 21% PEG400, 100 mM Tris-HCl pH 8.0, 
100 mM NaCl, and 325 mM sodium acetate. Rod-shaped crystals appeared over- 
night and typically grew to full size in about 1 week. Crystals were dehydrated by 
exposing the drops to air for 5 min. The best diffraction reached 2.95 A at SSRF 
beamline BL17U. Platinum derivatives were obtained by soaking the crystals for 
48h in mother liquor containing 10 mg ml‘ K,Pt(NO,),. Seleno-L-methionine- 
incorporated crystals were also obtained and reach similar diffraction with heavy- 
atom-derived crystals. Diffraction data for heavy-atom and selenomethionine 
derivatives were collected at SPring-8 beamline BL41XU. 

Data collection and structure determination. All anomalous diffraction data, 
including Pt-SAD and SeMet-SAD data, were collected at SPring-8 beamline 
BL41XU and processed with the package HKL-2000 (ref. 26) with routine procedures. 
The diffraction images from the severely anisotropic native crystal were collected at 
SSRF beamline BL17U and integrated with DENZO”. Before the .x files were 
inputted into SCALEPACK” for merging and scaling, the anisotropic ellipsoidal 
truncations on the .x files were performed with the special version of the ellipsoidal 
truncation program provided by the University of California, Los Angeles, MBI 
Diffraction Anisotropy Server*'. The applied resolution limits along the a*, b* and 
c* directions are 3.77, 3.31 and 3.10 A, respectively, on the basis of on the criterion 
of F/o larger than 3.0. The pruned data were then used for structural determination 
and refinement. Further processing was carried out with programs from the CCP4 
suites’. Data collection statistics are summarized in Supplementary Table 1, and 
Supplementary Table 2 compares the data completeness before and after the 
anisotropic truncation. 

The platinum positions were determined with the program SHELXD™. The iden- 
tified platinum sites were then refined, and initial phases were calculated in the 
program PHASER™. Cross-crystal averaging with all the three data sets combined 
with solvent flattening, histogram matching and non-crystallographic symmetry 
(NCS) averaging in DMMulti® gave a map of sufficient quality for model building. 
An initial model was built into the experimental map with COOT”’. The sequence 
docking was aided with the selenium sites in the anomalous difference Fourier map. 
The structure was refined with PHENIX”. 


Preparation of oxidized protein. Oxidation of the WT GadC and the GadC 
mutant L267C/N364C was performed with o-phenanthroline copper complex’. 
The oxidation system comprised 25 mM MES buffer pH 5.5, 150 mM KCl, 2.5 mg 
ml ' protein and 0.9mM o-phenanthroline copper complex. The reaction was 
performed for 2h on ice. 

Preparation of liposomes and proteoliposomes. Liposomes of E. coli polar lipid 
(Avanti) was prepared using a standard protocol as described previously**. For the 
study of membrane potential on substrate transport by GadC, liposomes were 
loaded with 5mM GABA and either 120mM sodium phosphate pH 5.5 or 
potassium phosphate pH5.5. For all other transport assays of GadC, liposomes 
were loaded with various choices of buffer system (25 mM) depending on the assay 
purposes, 150 mM KCl and 5mM GABA (or other amino acid and their deriva- 
tives). For AdiC, 5 mM GABA was replaced with 5 mM agmatine or arginine. WT 
or mutant GadC or WT AdiC were incorporated with liposomes to form proteo- 
liposomes by incubation with pre-extruded liposomes together with 1.25% B-OG 
(Anatrace) at a concentration of 5 pg mg" lipids. B-OG was removed by incuba- 
tion overnight with 400 mg ml! Bio-Beads SM2 (Bio-Rad). The proteoliposomes 
were harvested by ultracentrifugation for 1h at 150,000g and rinsed twice with 
resuspension buffer (various choices of buffer system (25 mM), 150 mM KCl). The 
proteoliposomes were resuspended with the resuspension buffer to a final lipid 
concentration of 100 mg ml’. 

In vitro transport assay. All transport assays were performed at 25°C. For the 
study of membrane potential on substrate transport by GadC, the reaction was 
initiated by adding proteoliposomes (2 11) to 100 pl of external buffer containing 
120mM sodium phosphate pH5.5 or potassium phosphate pH5.5, 50M 
unlabelled 1-glutamic acid and 1 \1Ci of L-[°H] glutamic acid (specific radioactivity 
51.1 Cimmol !; PerkinElmer Life Sciences), with or without 1 jig ml I valinomycin. 
For all other L-glutamic acid uptake assays of GadC, the reaction was initiated by 
adding proteoliposomes (21) to 100 pl of external buffer containing 25 mM 
pH buffer, 150mM KCl, 504M unlabelled .-glutamic acid and 1Ci of 
L- [PH] glutamic acid. The final concentration of L- [PH] glutamic acid in the external 
buffer was about 0.19 1M. For AdiC, unlabelled and °H-labelled t-glutamic acid 
were replaced by unlabelled and *H-labelled L-arginine. The uptake of *H-labelled 
substrate was stopped at the indicated time points by rapidly filtering the reaction 
solution through a 0.22-1m GSTF filter (Millipore) and washed with 2 ml of ice- 
cold wash buffer (25 mM glycine pH 9.5, 150 mM KCl). The filter was then taken 
for liquid scintillation counting. All experiments were repeated at least three times. 
The reactions lasted for various lengths of time depending on the different assay 
purposes. 

For determination of V,,, and K,,, the same substrates were used on both sides 
of proteoliposomes (Glu/Glu, Gln/Gln, GABA/GABA for GadC; Arg/Arg for 
AdiC). The chosen time points were within the linear range of substrate accu- 
mulation. The preparation of proteoliposomes and the transport assay process 
were as described above. 


31. Strong, M. etal. Toward the structural genomics of complexes: crystal structure of 
a PE/PPE protein complex from Mycobacterium tuberculosis. Proc. Natl Acad. Sci. 
USA 103, 8060-8065 (2006). 

32. Bailey, S. The Ccp4 suite—programs for protein crystallography. Acta 
Crystallogr. D Biol. Crystallogr. 50, 760-763 (1994). 

33. Schneider, T. R. & Sheldrick, G. M. Substructure solution with SHELXD. Acta 
Crystallogr. D Biol. Crystallogr. 58, 1772-1779 (2002). 

34. McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Cryst. 40, 658-674 
(2007). 

35. Cowtan, K. dm: an automated procedure for phase improvement by density 
modification. CCP4 ESF-EACBM News. Protein Crystallogr. 31, 34-38 (1994). 

36. Veenhoff, L.M. & Poolman, B. Substrate recognition at the cytoplasmic and 
extracellular binding site of the lactose transport protein of Streptococcus 
thermophilus. J. Biol. Chem. 274, 33244-33250 (1999). 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


doi:10.1038/nature10918 


Choice-specific sequences in parietal 
cortex during a virtual-navigation 


decision task 


Christopher D. Harvey'?**+, Philip Coen* & David W. Tank)?*4 


The posterior parietal cortex (PPC) has an important role in many cognitive behaviours; however, the neural circuit 
dynamics underlying PPC function are not well understood. Here we optically imaged the spatial and temporal activity 
patterns of neuronal populations in mice performing a PPC-dependent task that combined a perceptual decision and 
memory- guided navigation in a virtual environment. Individual neurons had transient activation staggered relative to 
one another in time, forming a sequence of neuronal activation spanning the entire length ofa task trial. Distinct sequences 
of neurons were triggered on trials with opposite behavioural choices and defined divergent, choice-specific trajectories 
through a state space of neuronal population activity. Cells participating in the different sequences and at distinct time 
points in the task were anatomically intermixed over microcircuit length scales (<100 micrometres). During working 
memory decision tasks, the PPC may therefore perform computations through sequence-based circuit dynamics, rather 
than long-lived stable states, implemented using anatomically intermingled microcircuits. 


In real-world tasks, decision-making and working memory often occur 
in the context of other complex behaviours, including spatial naviga- 
tion. For example, when driving through a city towards a destination, 
sensory information defining context and place engages memory and 
decision circuits to plan turns at upcoming intersections. The PPC is a 
prime candidate for the neuronal circuitry combining the cognitive 
processing elements necessary for such tasks. In primates, the PPC is 
important for perceptual decision-making and categorization’, 
movement planning* and spatial attention’. Studies in rats suggest that 
the PPC is also important for encoding route progression during 
navigation®’. Using a virtual-reality system for mice’ and cellular 
resolution optical imaging methods'"’, we developed a T-maze-based 
navigation task combining all these cognitive elements and characterized 
the neuronal circuit dynamics in the PPC, which have not been studied 
in this combined behavioural context. 

Neuronal activity patterns in the PPC have been studied using 
microelectrode recordings during spatial attention, working memory 
and perceptual decision tasks. These studies have commonly found 
cells with sustained firing rate changes spanning entire task periods 
(cue, delay, response periods)***"’. For example, cells with persistent 
activity throughout the delay period of memory-guided saccade/reach 
tasks have frequently been recorded'*"’. Also, studies have identified 
neurons with ramps of increasing firing rate spanning the accumula- 
tion of evidence period in a motion perception task'*. Neurons with 
sustained activity during the same task period often have similar 
activity time courses, suggesting the presence of classes of cells (for 
example, delay cells) and implying that the task-dependent neuronal 
dynamics are low-dimensional. The low-dimensional dynamics can 
be reproduced in recurrent attractor network models, in which each 
cell’s activity is typically an amplitude-scaled version of a prototypical 
time series'**’. In contrast, recent analysis of prefrontal cortex activity 
has identified heterogeneous neuronal activity time courses, in which a 
neuron’s activity can be thought of as the sum ofa few activity modes”. 


In addition, there is growing evidence for sequences of neuronal activa- 
tion within local circuits, in which each neuron is active for only a 
fraction of a task period, including during working memory tasks in 
the prefrontal cortex**** and the hippocampus” and during an object 
construction task in the PPC**. Sequences suggest dynamics that are 
high-dimensional, without the presence of classes of cells with relatively 
homogeneous activity time courses. 

We explored whether PPC dynamics are best described in terms of 
cell classes or high-dimensional dynamics. During a navigation-based 
decision task, the dynamics were high-dimensional: neurons were 
active in choice-specific sequences in which information moves from 
one neuronal population to another across time in the task; although the 
neuronal activity patterns could be divided into cue, delay and response 
groups, sequences were present within each group. Furthermore, 
exploiting the ability of cellular resolution optical measurements to 
provide the relative anatomical location of the recorded cells'”*’*”, we 
found that neurons active during behaviourally distinct task periods 
and on trials with different behavioural choices were spatially inter- 
mixed over microcircuit length scales. 


A PPC-dependent navigation-based decision task 


Using a virtual-reality system’®, we trained mice to navigate through a 
virtual T-maze in a task that incorporated both visual discrimination 
and a memory-guided response (Methods; Fig. 1a). Visual cues present 
in the initial section of the maze indicated which direction to turn at the 
T-intersection to receive a water reward (one set of cues to indicate a 
right turn and a second set for a left turn; Supplementary Fig. 1c). 
Between the cue section and the T-intersection, mice ran through a 
delay maze section that was identical on left and right turn trials; visual 
information about the reward location was thus present only in the cue 
section. The task resembled traditional delay tasks'*’°** in its cue- 
delay-response structure, but differed in that continuing sensorimotor 
activity was present throughout the task, including the delay period. 


1Princeton Neuroscience Institute, Princeton University, Princeton, New Jersey 08544, USA. *Bezos Center for Neural Circuit Dynamics, Princeton University, Princeton, New Jersey 08544, USA. Lewis- 
Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA. *Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544, USA. Present 
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Figure 1 | A PPC-dependent decision task in virtual reality. a, Diagram of 
the two versions of the virtual T-maze that differed only in the cue period and 
the reward location. Patterns in the diagram reflect the patterns present on the 
virtual maze walls. b, Behavioural performance on individual training (grey) 
and imaging (black) sessions. c, Rotational velocity of the spherical treadmill 
about the vertical axis for view angle changes on correct right (red) and left 
(blue) trials, aligned to the cue offset and the turn onset. d, Behavioural 
performance on a memory-guided task from a after receiving no injections 
(grey), saline (open circles) or muscimol (black) bilaterally in the PPC. 
Connected dots are from individual mice across daily sessions (n = 3 mice). 
e, Same as in d except for a visually-guided task (Supplementary Fig. 1b; key in 
e applies to d also). 


Mice performed the task with high levels of accuracy (83 + 9% correct, 
P<0.0001 versus 50%, t-test; Fig. 1b). Within and across trials, indi- 
vidual mice ran at highly consistent speeds (Supplementary Fig. 2); 
however, running speeds varied between mice, and the maze position 
at which mice began rotating the spherical treadmill to make a right or 
left turn differed across trials (turn onset; Fig. 1c, Supplementary Fig. 3; 
Methods). To compare behaviourally similar epochs of the task across 
trials, individual trials were aligned to the time points when the cue was 
no longer visible (cue offset), the turn onset, and the end of the trial; 
these alignment points formed the boundaries of cue (trial start to cue 
offset, 5.1 + 2.6), delay (cue offset to turn onset, 4.2 + 1.2) and turn 
task periods (turn onset to trial end, 3.0 + 1.85). 

Because the location of the mouse PPC has not been characterized, 
we first performed retrograde and anterograde labelling experiments 
to locate it anatomically (Methods, Supplementary Fig. 4). We iden- 
tified a region consistent with the rat and primate PPC based on the 
set of areas from where it received axonal projections, the areas to 
which it sent projections, and its location relative to other cortical 
regions*** (anterior to visual cortex and posterior to somatosensory 
cortex). We therefore considered this area to be the mouse PPC. 

To test if the PPC was required for the behavioural task, we in- 
activated it using bilateral injections of the GABA, receptor agonist, 
muscimol. Muscimol reversibly decreased behavioural performance 
from high levels of accuracy in control sessions to near chance levels, 
but did not affect the rate of trials performed (fraction correct; no 
injections 87 + 7%, saline 85 + 5%, muscimol 54 + 5% P<0.0001 
versus no injections, t-test: trials per minute; no injections 3.2 + 0.5, 
muscimol 3.1 + 0.3, P> 0.8, t-test; Fig. 1d). In contrast, PPC in- 
activation did not significantly affect performance on a visually guided 
task in which a visual cue was present at the reward site and visible 
throughout the trial, indicating that the decrease in performance on the 
memory-guided task was unlikely to be due to a major visual or motor 
deficit (fraction correct; no injections, 90 + 5%; muscimol, 87 + 5%, 
P> 0.2, t-test; Fig. le, Supplementary Fig. 1b). 


Imaging sequences of neuronal activity 


We used two-photon microscopy to image layer 2/3 PPC neurons 
expressing the genetically encoded calcium indicator GCaMP3, which 
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increases in fluorescence intensity in response to action potential 
firing (Methods). On average, we imaged ~65 cells simultaneously 
within an area ~300 pm by ~150 jim (range, 37-94 cells). Nearly all 
imaged cells showed statistically significant Ca** transients during 
the behavioural session (96% of cells had >0.2 transients per minute; 
Methods). Of the cells with high levels of activity (>2 Ca’* transients 
per minute on average; Supplementary Fig. 5), ~73% had significant 
increases in their mean fluorescence intensity traces (AF/F, averaged 
across trials) during a specific time in the trial or inter-trial interval 
(task-modulated cells; Fig. 2, Supplementary Figs 6, 7). These task- 
modulated cells had Ca*” transients for only short time intervals on 
individual trials (11 + 8% of time points in trials with a transient) 
such that only a small fraction of neurons was active simultaneously 
(Fig. 2b-e, Supplementary Figs 8, 9a); cells with prolonged activity 
patterns covering a large fraction of the trial were not observed. The 
majority (~71%) of task-modulated cells had significantly different 
levels of activity on correct right and left choice trials (choice-specific 
cells). Similar choice-specific, task-modulated activity patterns were 
observed in extracellular electrophysiological recordings (Supplemen- 
tary Figs 10, 11). Cells were also active on error trials, such that neurons 
active during the cue period tended to be correlated with the cue 
identity, and neurons active during the turn period in general were 
correlated with the behavioural response (Supplementary Fig. 12). 
Only a small fraction of cells had obvious reward-related signals 
(~2% of active cells with P<0.01, t-test, comparing AF/F values 
within ~0.6s after the reward was given on correct trials or missed 
on error trials). 

When the activity patterns of all the choice-specific, task-modulated 
cells were ordered according to the time profile of their Ca’* transients, 
the active periods across cells were staggered relative to one another in 
time, forming a sequence of neuronal activation covering the entire trial 
length (Fig. 2c, d, Supplementary Fig. 7c). Different sequences of 
neurons were activated on left and right trials (Fig. 2c, d and absence 
of activity in Fig. 2d lower plot, Supplementary Fig. 14e). Although 
these plots of sequences combined cells from different experiments 
and averaged across trials, similar properties were observed when 
considering only the cells imaged in a single mouse and on individual 
trials (Fig. 2c, Supplementary Fig. 13). Sequences were also apparent in 
the ~29% of task-modulated cells that did not have choice-specific 
activity; these cells participated in the sequences for both right and left 
choice trials (Supplementary Fig. 7b). In total, ~73% of the highly 
active cells participated in sequences. Sequences similar to those during 
the task were not observed in shuffled versions of the data set, demon- 
strating that the sequences were not an artefact created by ordering the 
data (Supplementary Fig. 14a-d). 

Because previous studies of the PPC have categorized cells into 
classes with cue, delay or response period activity'*’’, we examined 
the activity patterns to see if neurons in the sequence were grouped on 
the basis of behavioural periods. The distribution of activity times of 
all cells in the population (calculated for each cell as the centre-of- 
mass (COM) in time of the mean AF/F during the trial, tco) had 
three peaks corresponding to the cue, delay and turn periods, suggest- 
ing a possible grouping by behavioural period (Fig. 3a). Consistently, 
principal component analysis (PCA) of the mean AF/F traces for all 
cells revealed three intermixed groups, with each group mostly con- 
taining cells preferring the same behavioural period (Fig. 3a, b, Sup- 
plementary Fig. 15). 

Although the population of neurons could be divided into groups, 
the temporal activity patterns within each individual period were 
heterogeneous and formed sequences (Fig. 3c). Cells within their pre- 
ferred period were active for only a fraction of the period (35 + 16% of 
time points in preferred periods with a Ca*~ transient), with different 
cells active at different times. Although cells with activity covering a 
large fraction of the period were occasionally observed (for example, 
Fig. 2c top panel), these cells were rare (4% of cells with activity lasting 
for >60% of the period; Supplementary Fig. 9b); the distribution of 
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Figure 2 | Imaging PPC neuronal activity during the T-maze task. a, Left: 
example image of GCaMP3-expressing neurons in layer 2/3. Right: example 
fluorescence intensity traces (AF/F; grey portions indicate significant Ca’* 
transients, Methods) for three example cells from the left panel on correct right 
(red) and left (blue) trials. b, Activity patterns during the task for cells 1-3 from 
a. Top: colour-coded AF/F traces for individual correct left and right choice 
trials. Each row isa single trial aligned to the cue offset, turn onset and trial end. 
Bottom: mean AF/F traces for correct right (red) and left (blue) choice trials. 
Dashed lines indicate mean + s.e.m. c, Normalized mean AF/F traces for all the 
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epoch coverage by an individual neuron’s activity was similar during 
the delay and cue or turn periods (P > 0.1, Kolmogorov-Smirnov test). 
Furthermore, Pearson’s correlations between the non-averaged AF/F 
traces for cell pairs with the same trial-type and behavioural period 
preferences varied widely, with a large fraction of pairs having low 
correlation coefficients (Supplementary Fig. 16a). The low correlation 
coefficients could be due to activity at different times in the period, as 
expected for sequences, or activity at the same time in the trial except 
on different trials. However, the probability that both cells in these 
pairs had Ca’* transients or that both cells did not have transients 
on the same trial during their preferred epoch was generally high 
(Supplementary Fig. 16b), suggesting that the diversity primarily 
resulted from differences in the activity times of cells within trials. 
Together these data indicate that classes of cells with homogeneous 
activity patterns were not present. Rather, choice-specific sequences of 
neurons were activated in all behavioural periods, with a lower density 
of cells in the sequence at the borders between periods. 

To further examine the sequential neuronal activation on indi- 
vidual trials, we calculated correlations between the activity patterns 
of cells using the non-averaged AF/F time series. Cells that were active 
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choice-specific, task-modulated cells (one cell per row) imaged in a single 
mouse and divided by left-preferring (n = 51) and right-preferring (n = 54) 
cells. Traces were normalized to the peak of each cell’s mean AF/F trace on 
preferred trials and sorted by the peak time. Some cells were imaged on 
different days and in different fields-of-view. d, Same as in c, except for all mice 
(n = 404 cells from 6 mice) on preferred and opposite trials. e, Fraction of active 
cells with significantly different activity levels on right and left choice trials as a 
function of trial time. 


at similar time points in the trial on average (measured as the differ- 
ence in tcom values on correct trials, Atco) had, on correct trials and 
in their preferred behavioural periods, highly correlated AF/F traces 
and peaks in their cross-correlation at a lag approximately equal to 
Atcom (Supplementary Fig. 16d—h); these relationships in the non- 
averaged AF/F time series provide further evidence for sequential 
activity on individual trials. Cells that were active at similar times 
on correct trials were also highly correlated on error trials and in 
non-preferred periods (Supplementary Fig. 16c-f). Additionally, cells 
that were sequentially activated during their preferred behavioural 
period on correct trials were sequentially active with similar lags 
during error trials and other time points in the task (Supplementary 
Fig. 16g-i). 

The choice-specific activity could result if mice experienced differ- 
ent visual stimuli and running patterns on right and left trials and if 
PPC activity was modulated by those differences. To examine this, we 
first performed a multiple regression analysis to determine the poten- 
tial effects of the parameters defining the mouse’s running trajectory 
on the fluorescence changes during the delay period (Supplementary 
Table 1). These parameters could not explain the choice-specific 
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Figure 3 | Neuronal activity in individual behavioural periods. a, Histogram 
of the times of the centre-of-mass of the mean AF/F trace (t¢gm) for choice- 
specific, task-modulated cells. Cells were separated into three groups (cue-, 
delay- and turn-preferring cells; varying shades of green) based on peaks in the 
distribution. b, PCA on the normalized mean AF/F traces for all the choice- 
specific, task-modulated cells. Left, scores for each cell plotted for the first two 
principal components (PC 1, 2). Right, cells categorized on the basis of 
groupings from a. n = 404 cells. c, Sorted normalized mean AF/F traces for cue- 
preferring (n = 101), delay-preferring (m = 133) and turn-preferring (n = 170) 
cells, aligned to the trial start, cue offset and turn onset, respectively, on the 
preferred trial-type. 


activity patterns, suggesting that any differences in running trajectories 
between right and left trials did not trigger the activity we observed. In 
addition, we performed two sets of experiments to further examine 
whether the maze visual stimuli alone or the mouse’s running patterns 
triggered PPC activity. In the first experiments, movies of simulated left 
and right turn runs through the T-maze, which closely approximated 
real runs, were played to mice that passively viewed the visual scenes 
(open-loop experiments; Supplementary Fig. 17a, b). In the second 
experiments, mice were trained on a virtual linear track to perform a 
simple running back and forth task (run to one end for a reward, turn 
around, run to the other end for the next reward). The linear track had 
several visual patterns on the walls, all of which were the same as 
patterns in portions of the T-maze (Supplementary Fig. 18a, b). The 
task required similar running and turning behaviours to the T-maze, 
except that turns were not memory-guided based on visual cues and a 
delay period, and there was not a two-alternative forced-choice struc- 
ture. The overall levels of activity during the simulated T-maze runs or 
during the linear track task were much lower than when mice actively 
performed the T-maze task (Supplementary Figs 17c, d, 18c, d). Also, 
only a small fraction of neurons had significant increases in activity at 
specific locations in the maze, either during the simulated T-maze runs 
or in the linear track, suggesting that cells were not activated robustly 
by location-specific visual scenes or running patterns (simulated runs, 
1.3% of neurons; linear track, 5.8%; T-maze, 32.3%; Supplementary 
Figs 17e-k, 18e-k). Together these results suggest that PPC neurons in 
the T-maze were not activated only by the visual information or by the 
running patterns of the mouse. 


Choice-specific neuronal circuit trajectories 

The heterogeneous and sequential neuronal activity patterns during 
the T-maze task indicated that we should consider the dynamics of the 
population rather than classes of cells. We therefore analysed the 
dynamics as a trajectory through a state space of neuronal population 
activity (neuronal circuit trajectory)****. At each time point, the activity 
state of the circuit containing n simultaneously imaged neurons was 
defined as a point in an n-dimensional space, with each dimension 
representing the activity (AF/F values) of a single neuron. Different 
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trajectories (visualized using factor analysis for dimensionality reduc- 
tion) were traversed for trials with different behavioural choices 
(Fig. 4a, b, Supplementary Fig. 19a). The trajectories for correct right 
and left choice trials began the trial at similar positions, gradually 
diverged to reach a peak separation near the time of the behavioural 
choice, and converged to the starting point in the inter-trial interval 
(Fig. 4a-d). To quantify the trajectory divergence, we used a classifier 
based on the distance from an individual trial trajectory to the mean 
right and left choice trajectories at single time points. It was possible, 
from the activity of a small population of neurons located in close 
anatomical proximity (~65 neurons separated by <250 tm), to pre- 
dict the mouse’s choice on single correct trials at better than chance 
levels during the cue, delay and turn periods (Methods; Fig. 4e, 
Supplementary Fig. 19b, c). The activity in the PPC can therefore be 
considered as divergent, choice-specific trajectories through a state 
space of neuronal population activity. 

Trajectories were highly variable on error trials. Some trajectories 
began close to the correct choice trajectory during the cue period and 
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Figure 4 | Neuronal circuit trajectories on correct and error trials. a, Time 
course of mean choice-specific trajectories on correct right (red) and left (blue) 
choice trials from one session (plotted for the first three common factors). 
Points labelled 1, 2 and 3 correspond to the mean times of the cue offset, turn 
onset and trial end, respectively. b, Example individual (thin lines) and mean 
(thick lines) trajectories for correct trials from the session in a. c, Euclidean 
distance between the mean trajectories on correct right and left choice trials 
(n = 29 individual sessions, grey). The black line indicates the mean. 

d, Trajectory selectivity index for individual correct trials from a single session, 
defined on the basis of distances as (to mean traj, opposite choice — dio mean traj, same 
choice)! (to mean traj, same choice a do mean traj, opposite choice)+ Values close to 1 and 
—1 indicate that the individual trial trajectory was near the mean trajectory of 
the same and the opposite behavioural choice, respectively. Mean trajectories 
were defined using correct trials only. Grey and black lines indicate individual 
trials and the mean, respectively. e, Classification accuracy for determining the 
behavioural choice of the mouse at different time points in the task during 
individual sessions (grey, n = 29; black, mean). The classifier was based on a 
distance-dependent classification scheme (see Methods) using correct trials. 
f, Example individual trial trajectories (grey and black) on right choice and left 
choice errors trials, plotted with the mean trajectories for correct right (red) and 
left (blue) choice trials. g, Same as for d, except on error trials. h, Same as for 
e, except on error trials. 
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transitioned towards the error choice trajectory later in the trial; such 
transitions occurred at a wide range of points in the trial (grey traces in 
Fig. 4f, Supplementary Fig. 19f). Other trajectories were similar to the 
error choice trajectory throughout the trial (Fig. 4f, black traces). On 
average, the error trial trajectories were more similar to the correct 
choice trajectory during the cue period and closer to the error choice 
trajectory during the turn period (cue offset; trajectory selectivity 
index = —0.13 + 0.22, P< 0.001 versus 0, t-test: trial end; trajectory 
selectivity index = 0.18 + 0.15, P< 0.001 versus 0; Fig. 4g; consistent 
results based on classification, Fig. 4h). Therefore, individual trajectories 
transitioned between the mean correct right and left choice trajectories 
at many time points during trials, but most frequently switched during 
the delay period. 


Anatomical micro-organization of PPC dynamics 

To determine how the cells implementing the activity dynamics were 
anatomically organized, we first examined the neuronal activity patterns 
to see ifcells with different response preferences, such as activity peaks at 
different times in the trial or different preferences for right and left 
choice trials, were present in the same regions of the PPC or separated 
into different areas. Each ~250 tum by ~125 jm area (that is, field-of- 
view with simultaneously imaged cells) contained both right and left 
choice-preferring cells of approximately equal numbers and cells with 
activity peaks at a wide range of times in the trial (Fig. 5a—c, Sup- 
plementary Fig. 20). Next, within each imaged area we compared the 
activities of pairs of neurons as a function of the distance between the 
neurons’ cell bodies. The difference in the trial-type selectivity for cells 
in a pair did not depend significantly on the distance between cells, 
indicating that left and right choice-preferring cells were intermixed 
(p = 0.04, Spearman’s correlation, P > 0.05; Fig. 5e). Similarly, Atcom 
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Figure 5 | Anatomical micro-organization in the PPC a, Example field-of- 
view with cells outlined and choice-specific, task-modulated cells coloured. 
Left, cells’ trial-type selectivity, defined as (AF/Fyight trials ~ AF/Fictt trials)/(AF/ 
Fright trials + AF/Fiett trias). Values close to 1 (red) and —1 (blue) indicate right 
and left choice preferences, respectively. Right, cells’ time of the centre-of-mass 
of the mean AF/F trace (tcom). tcom = 0 corresponds to ~3 s before the cue 
offset. b, Box plots of tcom values for task-modulated cells in individual fields- 
of-view (box edges, first and third quartiles; vertical line in the box, median; 
whiskers, range) c, Number of right and left choice-preferring cells in each field- 
of-view (n = 29; unity line is shown). d, Difference in tcoyy as a function of the 
distance between cells. e, Difference in the trial-type selectivity index as a 
function of the distance between cells. f, Pearson’s correlation between non- 
averaged AF/F traces (all time points) for all pairs of active cells (black) and cell 
pairs with the same trial-type and behavioural period preference (grey) as a 
function of the cell-cell distance. 
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for a cell pair did not differ with the distance between the cells, 
indicating that cells active during different periods of the task were 
intermingled (p = —0.01, P > 0.6; Fig. 5d). However, cells that were 
separated by less anatomical distance had AF/F traces that were sig- 
nificantly more correlated than cell pairs further apart, but the rela- 
tionship between the correlation coefficient and distance was weak (all 
pairs of active cells; p = —0.16, P< 0.001: pairs with the same trial- 
type and behavioural period preference; p = —0.22, P< 0.001; Fig. 5f). 
This weak relationship could be due to overlapping fluorescence 
changes, such as from dendritic signals, or could reflect an actual, weak 
spatial organization. Regardless, cells with highly different activity 
patterns were intermixed over short length scales, and an anatomical 
separation of the response properties we measured was not present. 


Discussion 


The choice-specific sequences of PPC neuronal activation we report here 
add to the growing list of studies that have identified cortical sequences 
of activity states during working memory tasks*””*°. Furthermore, 
because we demonstrated that PPC activity was necessary only for the 
memory-guided task, the sequences of activation were probably import- 
ant at least for the memory aspect of the task. Sequence-based dynamics 
may therefore be a common framework for circuit function during 
memory and decision tasks, including during navigation behaviours. 
Such dynamics could potentially be implemented using feedforward 
architectures” or liquid state machines*’” related to those that have 
been proposed for working memory. 

Our results also offer a way to unite previous work on neural coding 
in the PPC. Navigation, memory and choice information may be com- 
bined in the sequences such that the identity of the active sequence 
reflects choice-related information for working memory and move- 
ment planning*”, and that the currently active cell within the sequence 
reflects spatial or temporal progression through the task”*’, which were 
highly correlated in our task (Supplementary Fig. 2). It seems unlikely 
that PPC neurons only provided location information in a context- 
dependent manner, like hippocampal place cells’, because unlike 
place cells PPC neurons did not encode spatial location during a linear 
track task (Supplementary Fig. 18) or other tasks’. 

A possible explanation for heterogeneity and sequences in our 
experiments versus stereotypy and low-dimensional dynamics, which 
have been emphasized in previous recordings and models of PPC 
activity’!°'°?°?", is that the PPC adopts different dynamics depending 
on the demands of the behavioural task. Because traditional delayed 
saccade tasks, for example, have one spatiotemporal component 
during the delay period (fixation before making a response), the 
PPC may adopt sustained activity patterns. In contrast, during tasks 
that involve many spatial and temporal components, as are common 
in natural behaviours and during navigation, the PPC may utilize 
sequences of activation. Alternatively, sequences of activity may be 
present in the primate PPC during traditional tasks but have yet to be 
identified, consistent with emerging evidence for heterogeneous tem- 
poral response properties ***°. In addition, different regions or 
layers of the PPC may have differing activity dynamics®, or the 
dynamics of rodent and primate PPC circuits may differ. 

Because cells that were active at distinct time points in the task and 
that participated in different choice-specific sequences were spatially 
intermingled, our results indicate that functionally distinct sub- 
networks are anatomically interlaced in the PPC. This extends pre- 
vious work in sensory cortex, motor cortex and the hippocampus 
showing spatial intermixing of heterogeneous response properties in 
cells encoding qualitatively similar types of information (for example, 
orientation selectivity in visual cortex) or in cells with activity during 
similar task epochs'**’**. Our findings differ from the predictions of 
models that have emphasized the spatial clustering of similar response 
patterns, as in functional columns, and that propose connectivity 
defined by axonal-dendritic overlap without fine-scale specificity**””. 
Rather, our results support a model in which microcircuits are formed 
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by highly specific synaptic connectivity and are composed of neuronal 
motifs, such as those identified in brain slice recordings in sensory 
cortices***? and amongst visual cortical neurons with the same ori- 
entation selectivity. Our results showing that cell pairs that were 
sequentially active during their preferred periods on correct trials 
had similar activity relationships even during error trials and the 
inter-trial interval suggest the presence of such motifs (Supplemen- 
tary Fig. 16g-i). 

The behavioural task used here did not isolate the decision-making 
process. We note however that activity trajectories occasionally 
switched during a trial between the prototypical correct left and right 
choice trajectories, including frequently on error trials, suggesting 
that the mouse’s decision was not necessarily irreversibly reached 
immediately after a trial’s start (trajectory selectivity switch during 
the delay period or last 1 s of the cue period; 63% of error trials, 20% of 
correct trials; Fig. 4d, g, Supplementary Fig. 19e, f). Sequences may 
therefore play a role in some aspect of decision-related processes, but 
further experiments will be necessary to assess this possibility. 

Our results motivate consideration of a conceptual framework for 
decision-making and working memory in which sensory information 
used for the decision activates a neuronal sequence of activity. The 
sequence begins in a choice-independent state, which could be 
mediated by neurons that are not choice-specific (Supplementary 
Fig. 7b), and then moves towards a choice-specific trajectory and away 
from other trajectories in a manner dependent on the incoming 
information. A decision is proposed to be reached when the sequence 
of activity intersects a choice-specific trajectory; different decisions 
involve intersections with different trajectories. Upon reaching a 
decision, a working memory can be maintained by continuing along 
that choice-specific trajectory. Changing decisions would occur through 
transitions between trajectories, but as time progresses in the task, the 
state space distance between trajectories increases, in effect creating a 
larger barrier to change. In this view, decision-making and working 
memory utilize an ordered progression through a sequence in which 
information moves from one population of neurons to another over 
time. The framework we propose is an extension of a point-of-view first 
considered in describing the dynamics underlying behavioural choices 
in the leech nervous system”’. It has some similarities with (and some 
differences from) drift/diffusion-to-bound, race, and recurrent network 
models of decision-making implemented as neuronal integrator 
winner-take-all circuits”'®. These models are similar to the trajectory- 
based view in that different decisions correspond to a divergence in state 
space surrounding a separatrix. However, these models differ from the 
sequence framework in that decisions are reached when the activity 
approaches a choice-specific fixed point with working memory 
maintained as stable activity at that point. Thus, although these 
circuits can demonstrate divergent trajectories to reach the fixed points 
associated with different choices, the trajectories are defined by 
relatively homogeneous changes in the activity of the population, 
and the same set of neurons participate in all stages of the decision- 
making and working memory process. 


METHODS SUMMARY 


Using a previously described virtual-reality system’, male C57/BL6 mice were 
trained using operant conditioning to navigate through a virtual T-maze to 
receive water rewards. Translation and rotation in the virtual environment were 
controlled by the mouse’s running on a spherical treadmill. Training was 
performed using shaping implemented as a set of six mazes of increasing task 
difficulty. Retrograde tracing was performed using fluorescent beads, and 
anterograde tracing was performed following injections of adeno-associated virus 
(AAV) containing GFP or GCaMP3. Muscimol injections for PPC inactivation 
were made bilaterally ~350 1m beneath the dura (50 nl, 1 ng nl '). Imaging was 
performed using a custom two-photon microscope incorporated with the virtual- 
reality system, as described previously’’. Imaging occurred at 2-6 weeks after 
injection of AAV2/1-synapsin-1-GCaMP3 virus**. A complete description of 
the experimental methods and data analysis is available in the Supplementary 
Information. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature10976 


Corrigendum: Functional 
complementation between FADD 
and RIP1 in embryos and 
lymphocytes 


Haibing Zhang, Xiaohui Zhou, Thomas McQuade, Jinghe Li, 
Francis Ka-Ming Chan & Jianke Zhang 


Nature 471, 373-376 (2011) 


In Fig. 3b of this Letter, the plot for the NP205 epitope response in the 
Fadd ‘Rip! ~~ double-knockout (DKO) splenocytes was a duplicate 
of that for the NP38 epitope response. The correct figure panel is 
shown below. The panel has been corrected in the HTML and PDF 
versions online. 


NP205 


IFNy ——> 


Figure 1 | This is the correct panel from Fig. 3b of the original Letter. 
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Influenza’s tale of tails 


Epigenetics is a hot new research field, but it seems that the influenza virus already has it figured out. By mimicking 
epigenetic regulation in human cells, one flu strain suppresses the expression of antiviral genes. 


ALEXEI L. KRASNOSELSKY 
& MICHAEL G. KATZE 


severe, or even deadly, but most people 

recover within a few days, helped by their 
body’s immune system. Yet this apparently 
simple interaction between the virus and its 
host is deceptive. Influenza viruses are, in fact, 
masters at circumventing their hosts’ defences 
— they co-opt cellular protein-synthesis 
pathways to produce viral proteins, for exam- 
ple, and antagonize aspects of the immune 
response. In a paper published on Nature’s 
website today, Marazzi et al.' describe another 
viral evasion tactic. They show that an influ- 
enza protein called NS1 mimics a host-cell 
histone protein that is involved in regulating 
gene expression. By hijacking this regulatory 
machinery, the virus inhibits the cell's produc- 
tion of antiviral proteins. 

Cells tightly regulate the expression of 
specific genes according to an organism’s 
developmental stage and physiological state, 
and in response to environmental stimuli such 
as infection. Gene expression can be altered 
in various ways, including by transcription- 
factor proteins, which bind to DNA and 
regulate the transcription of certain genes, 
or through epigenetic modification — herit- 
able chemical changes to DNA and associated 
proteins that do not affect the DNA sequence 
but alter the rate of transcription. Epigenetics 
has rapidly become a much-studied topic 
since these regulatory mechanisms were first 
reported around 20 years ago. 

A common epigenetic alteration is the 
addition of methyl or acetyl groups to histone 
proteins, which are the ‘building blocks’ of 
chromatin, the scaffold that packages DNA 
into chromosomes. The four main histone 
proteins (H2A, H2B, H3 and H4) consist of a 
globular domain and an unstructured amino- 
terminal tail region that is dominated by the 
positively charged amino acids lysine and 
arginine. These amino acids can be modi- 
fied by enzymes that add or remove methyl 
or acetyl groups”. Such modifications alter the 
histone—DNA interaction, which in turn leads 
to dynamic changes in chromatin that allow 
specific areas of the chromatin to unfold, or 
other proteins to be recruited to the site. Thus, 
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Figure 1 | Interfering influenza. Histones are the building-block proteins around which DNA is wound 

to form chromatin. Epigenetic changes to histones, such as the addition of methyl groups (blue circles), can 
affect the rate of gene transcription at specific DNA sequences. a, The amino-terminal (NH,) ‘tail’ of the 
human H3 histone protein contains a string of amino acids (ARTK) that can be methylated at the lysine (K) 
residue by a methyltransferase enzyme, Set1. This enzyme is recruited by a protein complex called PAF1C, 
which also promotes elongation of RNA molecules during gene transcription. Methylation of the lysine in 
ARTK of the H3 histone tail is associated with active gene transcription. b, Marazzi and colleagues' show that 
the NS1 protein of the H3N2 influenza virus contains an amino-acid sequence (ARSK, close to the protein's 
carboxy (COOH) terminus) that mimics the H3 ARTK sequence. Using this sequence, viral NS1 can bind 
directly to PAFIC, which might direct Set1’s methylation activity to NS1 rather than to the H3 histone. The 
authors also show that NS1 is deposited at the promoter regions of the host's DNA to which methylated H3K4 
would typically bind, and that this interference suppresses the expression of antiviral genes. 


by regulating the accessibility of a particular 
region of DNA, epigenetic modifications can 
regulate gene expression. 

Some epigenetic changes are generally asso- 
ciated with a site of active gene transcription’. 
An example of such a change is methylation 
at the lysine residue (K) of the amino acid 
sequence designated ARTK in histone H3 
(known as the H3K4 modification). Marazzi 
et al.' found that the H3N2 influenza virus 
contains an amino-acid sequence (ARSK) 
very similar to the histone’s ARTK sequence 
(Fig. 1). This mimic sequence is found in the 
virus'’s NS1 protein, which is not essential for 
viral structure but is known to have other roles 
in evading the host's immune system*. The 
authors also show that this sequence similar- 
ity is functional — the NS1 tail can serve as 
a substrate for the histone-modifying enzyme 
Setl, a lysine methyltransferase. 

Marazzi and colleagues also demonstrate 
that H3N2’s histone-mimic tail binds directly 
to the transcription-elongation complex 
PAFIC (Fig. 1). PAF1C accompanies the 
transcription enzyme RNA polymerase I] dur- 
ing the formation of messenger RNA, and is 
thought to recruit the Setl enzyme that leads 
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to H3K4 methylation’. The authors propose 
that NS1 binding to PAFIC interferes with 
gene transcription in the host cell. Indeed, the 
researchers found that when cells from human 
lung tissue were infected with the H3N2 
virus, transcription of rapidly inducible genes 
was halted. But when the cells were infected 
with H3N2 viruses that had been mutated 
so that they could no longer bind PAFIC, 
transcription of these genes was unaffected. 
Furthermore, the authors provide compelling 
evidence that it is specifically the transcrip- 
tion of antiviral genes that is interfered with 
by the interaction between NS1’s histone- 
like tail and PAFIC. Thus, histone mimicry 
seems to support viral infection, and may 
give a selective advantage to viruses that have 
this ability. 

Is this mechanism unique to H3N2 viruses, 
and could it be a recent evolutionary adap- 
tation? Moreover, does possession of this 
mechanism have a role in the variation in 
infectivity and disease severity that is seen 
among influenza strains? The H3N2 virus is 
an emerging influenza subtype that has caused 
only a limited number of human infections 
during the 2011-12 flu season. However, 
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they seem to cause illness of around the same 
severity as that induced by standard seasonal 
influenza viruses, which do not contain the 
NS1 histone-mimic tail’. It will be necessary 
to assess other emerging influenza strains for 
histone mimicry to determine whether this 
mechanism is evolutionarily new, and to 
study more influenza cases to predict its 
potential impact on public health. Even if his- 
tone mimicry is not widespread and does not 
dramatically alter disease severity, Marazzi 
and colleagues’ discovery may provide a novel 
target for antiviral drugs aimed at modulating 
the host’s antiviral response. 

The finding raises another intriguing 
question: what would happen if a person is 
concurrently infected with H3N2 virus and 
a more virulent influenza virus, such as the 
highly pathogenic avian H5N1? Or what 
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might occur ifan H5N1 virus acquires the NS1 
sequences required for the protein to function 
as a histone mimic? The combined effect of 
a suppressed antiviral response and a highly 
virulent virus might lead to devastating conse- 
quences for the host. Alternatively, ifan overly 
active host response is itself the main culprit 
in causing severe influenza, as has been sug- 
gested®, a suppressed antiviral response could 
actually reduce disease pathology. 

Research aimed at addressing these 
questions may be hampered by an increas- 
ing wariness about generating recombinant 
H5N1 viruses’. But ongoing influenza research 
remains essential. Although epigenetic gene 
regulation is new to scientists, other mecha- 
nisms of gene regulation no doubt remain to be 
discovered, and we can be certain that viruses 
have already evolved to interfere with these 
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regulatory processes, and to use them to their 
advantage. m 
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istorical epochs can be defined by materials: from 

the Stone Age, Bronze Age, and Iron Age to the 

modern eras of steel and, for the last 60 years, 
silicon. These are the stuff that defined whole civilizations. 
And we may be on the threshold of a time dominated by one 
of the most common of all elements: carbon. 

The 2010 Nobel Prize in Physics went to scientists who 
revealed the remarkable properties of graphene, a single 
layer of carbon in which the atoms are arranged in hexagons. 
Graphene is the latest allotrope of carbon to be under the 
spotlight, following from the fullerenes (“‘buckyballs’) of the 
1980s and the nanotubes of the 1990s. 

Talk to some of the scientists studying graphene and 
they can find it hard to contain their excitement — it’s as 
if they've been handed a new building block from which 
to engineer the world. Electric charge moves through 
graphene impossibly fast. It’s extremely strong, yet flexible. 
Add to that its impressive optical behaviour — it’s equally 
transparent to ultraviolet, visible and infrared light — 
and you whet the appetites of those developing diverse 
technologies, including computer chips, touch screens 
and solar cells (page S38). On the downside, however, it is 
difficult to produce graphene in pieces large enough for 
practical purposes (page $32). 

Although graphene cannot directly replace silicon in 
the digital switches that comprise computers (pageS34), 
comparing it to silicon (page $43) is a useful analysis, and 
serves as a humbling reminder of how tricky it is to predict 
what any new material’s greatest impact will be. 
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graphene films for transparent 
electrodes 
S. Bae et al. 
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Super carbon 


Graphene is phenomenally strong, thin, flexible, 
transparent and conductive — and applications beckon. 


BY NEIL SAVAGE 


ne day in 2004, Andre Geim and Kon- 
() stantin Novoselov, two chemists at 
the University of Manchester in the 
United Kingdom, peeled down graphite to a 
speck of graphene and ran electrons through 
it. In doing so they opened up a world of pos- 
sibilities based on this material’s remarkable 
properties— and won themselves the 2010 
Nobel Prize in Physics. 
Though only recently identified as a distinct 
form of carbon, graphene is not exotic. Layers 
of it stacked together form graphite, the ‘lead’ 


in pencils. Graphene rolled up becomes a car- 
bon nanotube. Wrapped into a sphere, it’s a 
fullerene or, colloquially, a ‘buckyball. 

Unlike other carbon forms, graphene is a 
two-dimensional material, and the first exam- 
ple of such a thing in the real world. Philip Rus- 
sell Wallace, a theoretical physicist at McGill 
University in Montreal, Canada, predicted the 
electronic structure of graphene back in 1947, 
but many scientists believed 2D crystals were 
too unstable to exist. What Geim and Novo- 
selov managed to pull off, through their savvy 
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use of sticky tape, was a sheet of carbon a few 
micrometres across and just a single atom 
thick. This is because in graphite the atomic 
bonds between the layers are much weaker 
than the bonds across its layers. 

Graphene’ arrangement gives rise to some 
astonishing properties. Electrons move through 
it as if they have no mass. The material boasts an 
electron mobility (a measure of how fast charges 
travel) 100 times that of silicon. At room tem- 
perature, electrons can travel for several micro- 
metres through graphene without scattering 
— an order of magnitude further than in any 
other material. And researchers have demon- 
strated graphene transistors operating at more 
than twice the speed of the best silicon transis- 
tors of similar size. The behaviour of electrons 
in graphene also provides physicists studying 
quantum mechanics with a new research tool. 
Instead of having to use superconductors or flu- 
ids cooled to extremely low temperatures, they 
can observe the seemingly massless activity of 
electrons on their lab benches. 

Graphene stands out in other ways as well. 
Although it is the thinnest material known to 
exist, it is also the strongest ever measured: 100 
times stronger than steel. It conducts heat bet- 
ter than the previous champion, diamond. The 
atoms in graphene are packed so tightly that 
not even the smallest atom — helium — can 
pass through it. Yet graphene also stretches 
easily, giving it the ability to bend where other 
hardy materials would snap. 

One property that graphene lacks is a band- 
gap — a range of energy states where electrons 
cannot exist. Bandgap is a crucial character- 
istic that enables semiconductor transistors 
to switch between on and off states to form 
the zeroes and ones of digital technology. But 
what’s a problem for electronics is a boon for 
optics: graphene’s lack of a bandgap means the 
material absorbs light at roughly the same level 
across the spectrum, from ultraviolet far into 
infrared. Exploiting graphene’s optical and 
electronic abilities along with its strength and 
flexibility could lead to foldable plastic smart- 
phones, cheaper solar cells, or sensors that can 
detect single molecules of gas or identify indi- 
vidual DNA bases. 

Despite the intense research interest in 
graphene, real-world applications are in their 
infancy. It’s not yet clear if it will be able to sup- 
plant silicon in transistors any time soon (see 
‘Back to analogue, page $34). And although 
carbon is ubiquitous — simply heating any 
organic material to high temperature in a 
vacuum produces plenty — engineers are still 
grappling with ways to make high-quality gra- 
phene in sufficient quantities and at a cost to 
sustain commercial applications (see ‘Beyond 
sticky tape, page S32). Silicon took decades to 
find its signature role in technology; graphene’s 
journey has only just begun. = 


Neil Savage is a freelance writer based in 
Lowell, Massachusetts. 
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BATTLE OF THE ALLOTROPES 


For the past 25 years, researchers have been exploring the novel properites of one form of single-layer carbon after another. 
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SUPER-MATERIAL 


Graphene stands out for its superlative mechanical, thermal and electronic properties. 
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EXPLOSIVE GROWTH 


As research on graphene takes off, technology development is heavily concentrated in the United States and South Korea. 
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PRODUCTION 


Beyond sticky tape 


Flecks of graphene are easy to make. But producing sheets 
of pristine, electronics -quality material is another matter. 


BY RICHARD VAN NOORDEN 


nlike many hi-tech materials, graphene is 
| so simple to make that anyone can pro- 
duce it at home. First, press adhesive tape 
onto a chunk of graphite and pull: this peels offa 
thin flake of grey-black carbon. Then repeatedly 
stick the carbon-covered tape against itself and 
peel away: the carbon flake breaks up further into 
thin, faint fragments, each hundreds of microme- 
tres across. After a few rounds of this, some flakes 
have been whittled away to single-atom thickness 
—a fleck of graphene. 

All in all, it is a pleasingly artisanal way to 
make a remarkable material. The graphene is 
high quality, too: it was on similar products that 
Manchester University physicists Andre Geim 
and Kostya Novoselov measured extraordinary 
electron-transport properties in 2004 — experi- 
ments for which they won the 2010 Nobel Prize 
in Physics. “You can make several square milli- 
metres with a couple of hours’ work,’ says Peter 
Blake, cofounder of Manchester-based Graphene 
Industries, which sells such flakes to researchers 
for £500-1000 (US$780-1550) each. 

But one can't mass produce graphene with 
sticky tape. To do this, researchers use chemicals 
to split graphite into graphene platelets by the 
tonne. Unfortunately, with speed and volume 
comes variable quality: few of these platelets are 


perfect single sheets. The platelets are typically 
rough stacks of 5 to 10 carbon layers riddled with 
defects, and begrimed with other chemicals. Yet 
they still have the attributes that make graphene 
so appealing: they’re lightweight, strong, have 
a large surface area and conduct electricity and 
heat. It's already possible to buy dozens of differ- 
ent grades of graphene made this way as powders 
or in solution. Developing applications range 
from conductive inks and fillers in composites, to 
battery electrodes and electrochemical sensors. 
For more demanding electronics, however, 
researchers need much purer graphene in large 
sheets — millimetres or centimetres in diameter. 
The aim is to grow theoretically perfect graphene: 
a hexagonal, honeycomb carbon lattice with not 
a single atom out of place. In practice, a real car- 
bon lattice is strained by defects, polluted by 5- or 
7-membered rings, its edge is ragged, and it sits 
ona substrate that interferes with its properties. 
These imperfections impair conductivity and 
other electronic properties. Whether that matters 
in practice depends on the purpose: transparent 
conducting films in touch screens, for example, 
need not be as conductive as those in solar cells. 
Large carbon sheets and small imperfect 
carbon platelets are such contrasting forms of 
graphene that they could almost be considered 
different materials. In fact, each version has its 
merits, depending on cost and application. “We 
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dont all buy $5,000 suits. A lot of the time a $300 
suit is all you need to wear,’ says James Tour, an 
organic chemist who specializes in nanotechnol- 
ogy at Rice University in Houston, Texas. But for 
materials scientists allured by graphene’ extraor- 
dinary potential in electronics and optics itis the 
$5,000 suit — or high-quality graphene — that 
they strive to produce ona large scale. 


EXFOLIATING GRAPHITE 

For at least a century, researchers have been 
producing stacks of thin carbon platelets by 
pulling graphite apart into its constituent layers. 
(A description of a suspension of small graphite 
flakes was published in 1907, for example). But 
these days manufacturers, alert to graphene’s 
potential, produce consistently thinner platelets. 
Graphite’ layers are very close-knit, so acids are 
used to oxidise the material to graphite oxide, lay- 
ers of which are more easily separated to form 
graphene oxide. Chemical reduction of graphene 
oxide leaves graphene platelets crumpled in a 
solution, or as a powder: a process first reported 
in 1961 by German chemist Hanns-Peter Boehm, 
who called his product ‘thinnest carbon films: It 
is this method — with proprietary modifica- 
tions — that companies such as Vorbeck Mate- 
rials, in Jessup, Maryland; Angstron Materials 
in Dayton, Ohio; and XG Sciences in Lansing, 
Michigan, still use to produce large quantities of 
graphene. Researchers constantly invent twists 
in the technique: in 2011, Chao Gao at Zhejiang 
University in Hangzhou, China, reported spin- 
ning a concentrated solution of graphene oxide 
flakes into fibres several metres in length, which 
could then be chemically reduced to filaments of 
pure graphene’. 

Angstron scientists have found a way to pro- 
duce graphene without the harsh acid treatment 
of the graphite oxide route. They disperse pieces 
of graphite in water and surfactant, and use ultra- 
sound to peel offlayers of carbon: a more pristine 
graphene as fewer chemicals are involved in its 
production. Andrea Ferrari, professor of nano- 
technology at the University of Cambridge, UK, 
who has worked on solutions of graphene for 
ink-jet printing, reckons this method might be 
useful in electronics that do not need the highest 
quality graphene, such as touch screens, ‘smart 
windows that can adjust light levels when a volt- 
age is applied, or conductive fabrics that might 
carry signals from iPods, for example, or heart 
monitors. 


GROWING GRAPHENE 

To make larger, higher-quality sheets of gra- 
phene, researchers are growing the material as 
a single-layer carbon film on top of another 
surface, a technique known as chemical vapour 
deposition. This isn’t new either: ‘monolayer 
graphite films, or graphene, were made this 
way in the 1970s. Typically, a mixture of meth- 
ane and hydrogen gas is passed across a sheet 
of copper in a furnace at 800-1000°C, anda 
single carbon layer forms atop the copper; 
chemical processing can remove the copper 
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GROWING GRAPHENE FILMS 
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Researchers make large (centimetre-scale) graphene films by depositing carbon atoms from a vapour onto a copper surface. Roll-to-roll 
processing then transfers the graphene film from copper to another substrate. 


@ 
@.,, 


Graphene 


“hal 


Copper foil 


Copper foil is heated to 800-1000°C in a 

furnace, where hydrogen (H2) and methane 
gas (CH) are piped through. Carbon atoms 
grow as graphene film on top of the copper. 


and deposit the graphene on a more useful 
substrate material, such as silicon dioxide (see 
‘Growing Graphene Films’). Tour says a labo- 
ratory using a US$10,000 setup can make gra- 
phene sheets a few square centimetres in size. 
And while methane is the standard reactant, 
any carbon source will do: Tour has even put 
cockroach legs on top of copper and produced 
a thin graphene film. 

A more sophisticated version of this approach 
yields graphene on an impressive scale. In 2010, 
researchers at Sungkyunkwan University in 
Suwon, South Korea, made transparent graphene 
films up to 76 cm across by depositing carbon 
atoms onto copper and etching the copper away’. 
Electronics giant Samsung wants to usea reel-to- 
reel version of this technique to produce graphene 
film ona continuous roll — although Seungmin 
Cho, principal research engineer at subsidiary 
Samsung Techwin in Gyeongnam, South Korea, 
says for now the company is concentrating on 
making high-quality single sheets. 

Materials scientists are working on ways to lift 
the graphene off the copper plate and onto insu- 
lating substrates — which wont interfere with 
electronic properties — without wrinkling the 
film or otherwise compromising its quality. Top 
of their list is silicon dioxide, the ubiquitous insu- 
lator used in silicon chips. In 2011, Tour showed. 
that when nickel is placed atop silicon oxide, 
a layer of carbon film grows at each surface of 
the interface between the two; etching away the 
nickel leaves a bilayer of graphene coated directly 
on silicon oxide’. But the growth process takes 
place at 1000°C: temperatures high enough to 
wreck any dopants or pre-structured devices 
(such as transistors) in the underlying silicon. 
Tour is confident that graphene can be grown 
easily on insulating substrates. “Pm reviewing 
papers all the time that promise improvements,’ 
he says. One paper this year, by Soon- Yong Kwon 
and colleagues at the Ulsan National Institute of 


Graphene 
on copper foil 


Polymer support 


Acidic bath 


Graphene on polymer support 


Adhesive ‘thermal release’ tape 
is attached on top of the 
graphene, by applying pressure 
between heavy rollers. 


Science and Technology in South Korea, prom- 
ises a similar process that works at just above 
room temperature’. 

Even a graphene film grown ona surface is far 
from perfect. The film forms when micrometre- 
sized patches of hexagonally-arrayed carbon 
atoms — each cluster a tiny fragment of graphene 
— sprout simultaneously from various points on 
the substrate. The clusters never stitch together 
perfectly, so the final graphene sheet looks like a 
patchwork quilt of carbon, with kinks of 5- and 
7-membered rings where each patch joins with 
the next. The largest single perfect graphene 
patch ever created is reported by nanotechnolo- 
gist Rodney Ruoff at The University of Texas at 
Austin, together with scientists at Texas Instru- 
ments in Dallas, who grew individual clusters up 
to 0.5 millimetres ona side’. 


FROM SHEETS TO RIBBONS 

Graphene film conducts electrons so well, it 
is hard to block their flow — a problem if one 
wants to use graphene in a transistor to control 
electrical current. But thin ribbons of graphene 
can be semiconducting, because the narrow 
channels alter the energy states of electrons. 
The problem is in how to make such a rib- 
bon. Using lithography to slice up large sheets 
into graphene strips is, says Tour, “like using 
a chainsaw to do the finishing work in your 
home” — the ribbon’s edges produced by this 
crude approach lack the uniformity required 
for electronic control. 

One approach is to start with carbon nano- 
tubes, which are essentially rolled up graphene. 
Researchers have variously unzipped them into 
flat sheets using metal catalysts, or etched 10-20 
nm wide ribbons from them using ionized argon 
gas. But control of the edges remains a problem: 
only one particular geometry of the ribbon’s edge 
will produce semiconducting graphene. 

Physicist Walter de Heer at the Georgia 


The copper is eaten away ina 
plastic bath filled with etchant 
(such as ammonium persulfate, 
a bleaching agent and oxidant). 


Released polymer support 


Target substrate 


Graphene on target 


On heating to 90-120°C, the 
adhesive tape unsticks, leaving 
the graphene clinging to a 
target substrate. 


Institute of Technology in Atlanta takes a dif- 
ferent tack: growing graphene ribbons directly 
in place ona substrate. He heats silicon carbide 
—a compound of silicon and carbon that has 
been mass-produced for over a century — to 
1,600°C, and the silicon atoms on top escape, 
leaving carbon behind. To grow graphene rib- 
bons exactly as required, de Heer notches a step 
into the silicon carbide; the graphene grows 
preferentially on the sidewall’. 

The best graphene ribbons developed so far 
are those grown molecule by molecule. Chem- 
ist Klaus Miillen’s team at the Max-Planck Insti- 
tute for Polymer Research in Mainz, Germany, 
make flawless ribbons via chemical reactions 
between molecules based on hexagonal rings 
— each molecule chosen to produce an atomi- 
cally precise graphene lattice when they fuse 
in long chains’. Miillen says his team can now 
make perfect graphene ribbons. The catch: they 
are only a few hundred nanometres long. De 
Heer envisages using similar chemistry to grow 
thin semi-conducting electronic interconnects 
between larger metallic areas of graphene — 
although such architectures and their applica- 
tions are decades from fruition. 

From the heights of molecular perfection, to 
the large, imperfect platelets on the graphene 
market: there’s space for all production meth- 
ods — though they seem worlds apart. m 


Richard Van Noorden is an assistant news 
editor at Nature. 
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Coloured scanning electron micrograph of a transistor composed of a graphene wire (centre), gold electrodes (dark yellow) and silicon (blue). 


ELECTRONICS 


Back to analogue 


Trying to shoehorn graphene into a digital circuit isn’t working. But there may be another 


potential path to glory. 


BY KATHERINE BOURZAC 


in an airport in Boston, Massachusetts, 

waiting for a delayed flight and thinking 
about his latest batch of graphene transis- 
tors. Yet again, they weren't working as they 
should. Palacios was excited when he learned 
about graphene only four years before, and 
he hoped this form of carbon could hasten 
a future generation of computer chips once 
silicon was pushed to its physical limits. But a 
major hurdle blocks the way: there seems to 
be no good way to make graphene transistors 
that can perform the on-off digital logic that 


1E July 2008, Tomas Palacios was sitting 


is the basis of computation. For the past few 
years, many researchers have been tinkering 
with graphene to make it behave more like a 
conventional material. Like Palacios, many 
had been disappointed. “It is very difficult to 
fight against nature,’ he says. 

With hours to kill at the airport, Palacios 
started to mull over his results. Studying some 
of his latest current-versus-voltage graphs, 
he realized that the electrical behaviour of a 
single graphene transistor looks like that of a 
circuit comprising multiple silicon ones. He 
called one of his students and asked him to test 
a single graphene transistor against the silicon 
circuit. When he finally got back to his lab at 
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the Massachusetts Institute of Technology in 
Cambridge, he saw that the graphene transis- 
tor outperformed the silicon grouping — and 
that his group had just created a new branch 
of research. 


NANOSPAGHETTI 

Ever since Andre Geim and Konstantin 
Novoselov of the University of Manchester, 
UK, published their seminal Science paper in 
2004 describing graphene’s electrical proper- 
ties, researchers have been trying to figure out 
what to do with the material. Some of this effort 
has been like trying to push a square peg into 
around hole. “Ifyou try to fit graphene into a 
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silicon box, it’s a bit awkward,” says James Tour, 
a chemist at Rice University in Houston, Texas. 
Now, many people are testing graphene in tran- 
sistors to see what it does well, which might not 
be digital computing. They’re shifting their 
focus towards speedier flexible electronics, 
more compact and efficient circuits that oper- 
ate in unusual ways, and high-performance 
analogue electronics that could improve tel- 
ecommunications and mobile data streaming. 

The faster that electrons can move from 
one end of a transistor material to the other 
— that is, the higher its mobility — the faster 
that information can move around a computer 
chip. When Geim and Novoselov wired up 
flakes of graphene, they found that electrons 
sailed swiftly through them. Indeed, electrons 
in graphene behave as if they have no mass. 
Graphene’s mobility at room temperature can 
be as high as 100,000 square centimetres per 
volt per second — some 70 times higher than 
it is for silicon. 

In addition to speed, graphene offers a poten- 
tial size advantage. Consisting of a single layer 
of carbon atoms, the material is effectively two- 
dimensional. This means it should be possible 
to make computer switches even smaller than 
the silicon transistors that form the backbone 
of the US$250-billion semiconductor market. 
“We would like to find a new switch, a mate- 
rial to replace silicon,” says Luigi Colombo, a 
researcher at Texas Instruments in Dallas. 

Graphene might also be easy to manufac- 
ture. Fengnian Xia, a researcher at the IBM 
Watson Research Center in Yorktown Heights, 
New York, says his lab can already grow sheets 
of the material on the silicon wafers that the 
semiconductor industry uses for large-scale 
chip manufacturing. Compatibility with exist- 
ing infrastructure is a big plus of graphene over 
other new materials. Carbon nanotubes, for 
example, have similar properties to graphene 
but fail the infrastructure test: they are difficult 
to align ona surface, tending to tangle up like 
spaghetti. 


MIND THE GAP 

The big problem with making digital switches 
from graphene is that the material lacks an elec- 
trical property called a bandgap — essential for 
digital logic. To understand this phenomenon, 
it's helpful to look at a conventional silicon tran- 
sistor, the type found in the hundreds of millions 
of today’s computer chips. These transistors have 
four main parts, a silicon channel and three 
metal terminals: gate, source and drain. An elec- 
tric voltage applied at the gate controls the flow 
of current from the source electrode through the 
silicon channel to the drain electrode. Silicon has 
certain energy states where it's conductive, and 


others where it’s not. 
> NATURE.COM The energy difference 
For some of the between those two states 
latest research on is called the bandgap. 
graphenetransistors: By applying particular 
go.nature.com/2d20vl voltages to the channel 


through the gate, it’s possible to switch silicon’s 
energy level between the conductive state, to 
turn the transistor on and make a 1, and the 
insulating state, to turn the transistor off and 
make a 0. Getting that zero is critical for effi- 
cient digital logic, and to keep power consump- 
tion in check. Because there are hundreds of 
millions of transistors in a microprocessor, it’s 
important to be able to turn them off when 
they're not in use, otherwise their power 
requirements would be unsustainable. But 
graphene does not naturally have a bandgap 
and can’t make a zero. Researchers have devel- 
oped ways to overcome this problem. Graphene 
has been doped with chemicals, punched with 
tiny holes, and exposed to all kinds of abuse to 
induce a bandgap. 

Unfortunately, such interventions also com- 
promise graphene’s charge mobility, so the 
resulting transistors are not fast enough to be 
worth the bother. “Nobody has made a switch 
out of graphene that is technologically feasible 

and comparable to 


“We knew from silicon,” says Walter 
the beginning de Heer, a physicist 
it would be at the Georgia Insti- 

° tute of Technology in 
. Atlanta, who made 
work for . gic some of the first gra- 


phene transistors. 
Novoselov agrees, but 
notes that people are 
still coming to grips with graphene. “We knew 
from the beginning that it would be challenging 
to make graphene work for logic applications,” 
he says. 

So researchers are now using graphene tran- 
sistors in a field that plays better to its strengths: 
analogue electronics. Analogue devices, the elec- 
tronics at work in communications equipment 
such as receivers, transmitters and amplifiers, 
don't speak in zeros and ones but operate over a 
range of voltages. Wherever an electronic device 
meets the outside world, there are analogue 
electronics. The chip board in a smartphone 
has many. Besides processing audio, it handles 
photographs taken with the camera, while 
other circuits stream video. So although much 
of what we think of as information processing 
now occurs in the digital realm, “analogue sets 
the speed limit on data’; says electrical engineer 
Deji Akinwande, who studies carbon nanomate- 
rials at the University of Texas at Austin. 


applications.” 


SPEEDING COMMUNICATIONS 

Phaedon Avouris, a materials scientist at IBM, 
has been working on graphene transistors since 
2006. Working independently, his group and 
Philip Kim's at Columbia University in New 
York made the first graphene transistors with 
a bandgap by slicing the material into ribbons 
tens of atoms wide with the help of an elec- 
tron beam. The procedure is complicated, and 
the resulting transistors are slow. “It became 
clear that this will not go too far in a practical 
application,” he says. “If graphene is really the 
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miracle material, use it as it is and don't try to 
change it into something else.” 

For Avouris, using graphene properly means 
taking advantage of its high mobility in high- 
frequency analogue electronics and its ability to 
integrate well with other materials. Generally, 
the faster the analogue electronics can switch 
— the higher their frequency — the less noisy, 
or error-prone, they are. Higher-frequency 
electronics can produce stronger signals that 
can travel farther, and they use less energy 
than low-frequency electronics. And signals 
sent at higher frequencies have lower interfer- 
ence, which means more of them can be nestled 
together in a given band of the spectrum for 
faster data transmission. 

The analogue electronics in consumer 
goods are, like digital electronics, based on 
silicon. For niche applications, such as encod- 
ing military messages, and transmitting and 
receiving signals in high-power mobile-phone 
base stations, speedier electronics made from 
more expensive materials are required. These 
so-called ‘TII-V’ materials (named after those 
two groups in the periodic table their elements 
come from) include gallium arsenide and 
indium phosphide. Some II-V transistors 
operate at frequencies as high as 1 terahertz 
(1,000 gigahertz). But whereas III-V materials 
aren't compatible with silicon electronics and 
need to be placed on their own chips, graphene 
is compatible with silicon. 

So far, no graphene circuits have reached 
terahertz speeds. In 2010, Avouris made 
transistors with speeds of 100 GHz — about 
ten times as fast as the best silicon transistors. 
Avouris believes the only way forward is to use 
the equipment already on the floor of today’s 
semiconductor factories, so his group has 
been mostly using graphene films on the sur- 
face of the same wafers used to make computer 
chips. The IBM result has been overtaken by 
Xiangfeng Duan, a chemist at the University 
of California, Los Angeles, with a 300-GHz 
graphene transistor made with graphene 
flakes. But these flakes are not as manufactur- 
ing-friendly as the films of graphene that the 
IBM group used. Both groups achieved these 
speeds by taking good care not to damage the 
graphene — a tough challenge because other 
materials must be laid on the graphene in order 
to make a transistor. 

Electronics require integrated circuits rather 
than single transistors. During the summer of 
2011, Avouris’ team made the first graphene 
integrated circuit on a wafer, reaching a record 
speed of about 10 GHz. The circuit combined 
a graphene transistor with two inductors to 
make one of the workhorse devices of telecom- 
munications: a frequency mixer. Frequency 
mixers combine a signal to be transmitted 
— say, the sound ofa voice as picked up by a 
mobile-phone microphone — with a reference 
signal so that it can travel through the air. Fre- 
quency mixers also unscramble such signals 
at the other end. Similar efforts to develop 
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READY T0 GO 


One of the advantages of graphene electronics 
is that they can be manufactured using 
equipment and processes already used by the 
chip industry. 

1. The 8-inch wafer is covered in graphene 
transistors. 


2. The wafer can be cut into chips. 
3. Microscope zooms in on a single transistor. 


4. View of the gates (horizontal bars) that 
control the transistor. 


graphene-based analogue electronics are under 
way at HRL Laboratories in Malibu, California, 
and at several universities. 

Many challenges remain. The IBM frequency 
mixers wont be found in mobile phones any 
time soon because the signals they produce 
are too weak, a problem that originates at the 
transistor level and that Avouris and others are 
trying to address. The hundreds of gigahertz 
speeds reported for graphene devices often 
ignore an important fact, says Columbia Uni- 
versity electrical engineer Ken Shepard. The 
faster these devices operate, the weaker the 
signal they generate. When the signal gets over- 
whelmed by noise, the circuit becomes useless. 
These effects, dubbed parasitics, mean that 
in reality, graphene transistors are not much 
faster than silicon ones. “For these transistors 
to be earth-changing they have to be at least 
an order of magnitude better than silicon and 
that’s just not there — they are a factor of two 
better,’ Shepard says. Whether these problems 
are fundamental or are engineering challenges 
isn't yet clear, he says. “As more designers look 
at this material, the technology issues will fall 
away, and we will get a better idea of what the 
intrinsic limitations are.” 


EMBRACING ECCENTRICITY 

Rather than try to engineer graphene for high- 
frequency devices that might one day supplant 
silicon-based electronics, some researchers are 
trying to beat new paths. Over the past cou- 
ple of years, says MIT’s Palacios, “people are 
focusing more and more on applications for 
graphene that silicon cannot do”. 

Graphene is stretchy, flexible, transparent 
and strong; silicon is none of these things. 
Because graphene has other attractive proper- 
ties, the material doesn't have to beat silicon to 
be useful for digital or analogue electronics, 
says the University of Texas’s Akinwande. He 
believes graphene is suited for applications such 
as ubiquitous computing or flexible, transpar- 
ent portable electronics that don’t break when 
dropped. Here the material is competing with 
very-low-electron-mobility materials such as 
semiconducting polymers. “The goal of flex- 
ible electronics is to make them waterproof, 
shockproof and environmentally friendly,’ says 
Akinwande. 

Akinwande is working on making gra- 
phene circuits on plastic. The challenges are 
in the manufacturing, he says. Compared to 
the smooth surface of a silicon wafer, plastic 
is bumpy and graphene is extraordinarily sen- 
sitive to tiny flaws. Akinwande is working to 
establish partnerships with companies, includ- 
ing 3M, a technology company based in St Paul, 
Minnesota, to get plastic that’s suitable for gra- 
phene transistors. So far, graphene-on-plastic 
transistors have an intrinsic speed of 5 GHz — 
significantly slower than Duan’s 300-GHz tran- 
sistors, but faster than the best organic polymer 
transistors, which share graphene’ flexibility. 

“Graphene will go first into applications 
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that only graphene can serve,’ says Palacios. 
He also sees promise for graphene in flexible 
electronics. And from that day in the airport in 
2008, he’s been working on turning his frustra- 
tion with graphene's lack of a bandgap into an 
advantage in highly efficient graphene devices: 
single transistors that do the work of many sili- 
con devices, and do it better. A single graphene 
device could do the work of “20 silicon transis- 
tors’, Palacios says, which would “reduce power 
consumption and save space”. 

Graphene transistors can multitask because 
as the voltage varies, they reach a minimum 
current and then switch back and forth between 
carrying negative and positive charges. Kartik 
Mohanram, an engineer at the University of 
Pittsburgh in Pennsylvania, likens such ambi- 
polarity to an eccentric water tap. It would be as 
ifwhen you turn off the cold tap, the flow tapers 
off before releasing a gush of hot water. 

Most engineers avoid graphene’s ambipolar- 
ity, making sure these devices operate only in 
voltage ranges that conduct either positive or 
negative charges, but not both. By varying the 
voltage on a graphene transistor all along that 
scale from positive to negative, engineers are 
able to get a device that performs more func- 
tions than a conventional transistor. Palacios 
says that one of the ambipolar transistors he 
designed can convert a digital signal into a 
Bluetooth signal that can be sent between, for 
example, a mobile phone and a wireless head- 
set. The conversion requires an analogue device 
called a phase-shift-key modulator. When 
made in silicon, this kind of modulator requires 
an integrated circuit chip. A single ambipolar 
graphene transistor, Palacios has shown, can do 
the job all by itself. 

Mohanram admits that the idea that ambipo- 
larity can have practical uses is not yet widely 
accepted by electrical 
engineers. “The big 
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E we can do this ona 
pd _ large scale,” he says. 
: 20 sh fia While circuit design- 


ers can come up with 
a new idea after a 
week's thought, it can 
take a year to make and test just a few devices. 
Engineers are still learning how to work with 
graphene, and new ideas come along faster than 
they can be tested. 

Ultimately, for graphene to make a big 
impact in electronics, the material will have 
to live up to its hype, and then some. As Texas 
Instruments’ Colombo puts it, silicon “is a very 
tough act to follow”. But patience could be the 
watchword. After all, it was only in 2004 that 
a large number of researchers began working 
with the material. “Maybe graphene will revo- 
lutionize electronics,’ says de Heer, the Georgia 
Tech physicist. “But not tomorrow.’ m 


transistors.” 


Katherine Bourzac is a science journalist 
based in San Francisco, California. 
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BIOELECTRONICS 


The bionic material 


Graphene could make an ideal basis for a medical repair kit. 


BY CHARLES SCHMIDT 


ore and more people are having their 
Mes body parts replaced with 

prostheses interconnected to the 
nervous system. Advances in graphene tech- 
nology might bring these artificial devices to 
their senses — in bionic eyes and ears. 

Graphene is impervious to the harsh ionic 
solutions found in the human body. Moreover, 
graphene’s ability to conduct electrical signals 
means it can interface with neurons and other 
cells that communicate by nerve impulse, or 
action potential. These features have made 
graphene a material of some promise in next- 
generation bionic technology. 

In November 2011, Jose Garrido, a nano- 
technologist at the Walter Schottky Institute 
in Munich, Germany, took a big bionic step 
when he showed that arrays of transistors 
made of graphene can detect action potentials 
in heart cells. 

Garrido’s work was a welcome break- 
through, says Philippe Bergonzo of CEA-LIST 
in Saclay, France; Bergonzo is coordinator of 
NeuroCare, a European project looking into 
using carbon-based implants in ears, eyes and 
the brain. Apart from its stability and favour- 
able electronic properties, graphene is also 
flexible, so it can be wrapped around delicate 
tissues. No other material shares all these fea- 
tures, Bergonzo says, adding that graphene 
opens up research opportunities in neural 
prosthetics. 

To make graphene transistors, Garrido uses 
a method called chemical vapour deposition 
(see ‘Beyond sticky tape, page S32) and grows 
a layer of cardiomyocytes directly on top of 
the array. 


HOW IT WORKS 
Like other electrogenic cells, cardiomyocytes 
use action potentials to pass electrical signals 
along from cell to cell. Each of those voltage 
spikes changes the local electrostatic environ- 
ment by inducing a flow of ions in the chan- 
nel in the electrolyte separating the cells and 
the transistors. Garridos transistors respond 
to this ion current by altering their electrical 
resistance, explains James Hone, who special- 
izes in nanoscale devices at Columbia Univer- 
sity in New York. The fluctuation in resistance 
constitutes a detectable signal between cardio- 
myocytes. 

Unlike silicon transistors, those made of 
graphene can’t be switched off — its physi- 
cal properties don’t allow for that, Hone says 


— and this means they're not suited to digital 
applications where devices must be able to gen- 
erate ones and zeroes (see ‘Back to analogue, 
page S34). What graphene transistors are good 
at, according to Garrido, is biological sensing 
— the sort of task that eyes and ears perform. 
In such analogue applications, the ability to 
switch offis not critical, and graphene’s distinct 
qualities come to the fore. Should silicon tran- 
sistors be used in the human body, they would 
need to be coated with metal oxide to boost 
their stability in solution, Garrido says. Those 


Cortical neurons being grown on graphene for use 
as biological implants (nuclei are stained blue). 


layers trap ions that produce noisy interfer- 
ence and thus degrade signal quality. Graphene 
transistors, on the other hand, don’t require an 
oxide coat so they generate less intrinsic noise, 
which enables them to detect the faint signals 
(generally below a few hundred microvolts) of 
cell communication. 

Graphene is not the only carbon-based mate- 
rial with bionic potential. Diamond nanocrys- 
tals show promise in retinal implants to treat 
blindness, says Bergonzo. But diamond is solid, 

inflexible and a poor 
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expected budget of US$6.3 million — the opti- 
mal neural device will both sense and stimulate 
cell activity. “That's where we expect a break- 
through from graphene,’ Bergonzo says. “Wed 
like to know how neural networks are behav- 
ing in real time so we can stimulate them more 
effectively. Otherwise, you risk giving too much 
stimulation, or not enough.” 

Serge Picaud, principal investigator with the 
Vision Institute in Paris, involved in the Neuro- 
Care project, adds that because graphene is so 
thin, it could improve the interface between ret- 
inal implants and eye tissues. And those closer 
connections, he says, could improve sight. 


REMAINING CHALLENGES 

Hone emphasizes that research into graphene- 
based bioelectronics is in its infancy. Scientists 
still face fundamental challenges in manufac- 
turing, he says. Chemical vapour deposition, in 
particular, doesn't generate perfect graphene, 
and this limits the material’s electronic perfor- 
mance (see ‘Beyond sticky tape; page $32). 

Moreover, materials scientist John Rogers at 
the University of Illinois in Urbana-Champaign 
cautions that silicon is still a contender. Silicon 
can be fabricated into structures as thin as 10 
nm; while that doesn’t match the 1-nm dimen- 
sions possible with graphene, it might just work. 
What’s more, Rogers points to a deep scientific 
and engineering base for silicon in the semicon- 
ductor industry. Rogers says that researchers are 
finding new ways to encapsulate implanted sili- 
con devices so they don't harm tissue. Still, Rog- 
ers sees tremendous opportunities for graphene 
in bioelectronic sensing because of the carbon 
material's much lower electrical noise. He envis- 
ages a hybrid approach that takes advantage 
of the strengths of both silicon and graphene. 
“T don’t think youd want to make your entire 
electronic system out of graphene,’ he says. “You 
could use ultra-thin silicon for switching and 
processing hardware and then use graphene to 
establish your interface with tissue.” 

For his part, Garrido says he’s now working 
to stack graphene transistor arrays on flexible 
substrates, such as the biocompatible polymers 
parylene and polyimide, each mechanically 
and heat stable. Graphene is clearly an “ena- 
bling material’, he says. “And it has properties 
that could lead to a whole new generation of 
neural devices.” m 


Charles Schmidt is a science writer based in 
Portland, Maine. 


1. Hess, L. H. et al. Adv. Mat. 23, 5045-5049 (2011). 
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Touch screens with a graphene electrode could bring flexibility to mobile phones and other devices. 


OPTOELECTRONICS 


Come into 
the light 


Transparency across the spectrum combined 
with electronic prowess makes graphene an ideal 
photonic material. 


BY NEIL SAVAGE 
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raphene’s first major appearances out- 
(5 laboratories may be at the inter- 

section of electronics and optics, in the 
form of transparent electrodes. Such electrodes 
— which carry current but allow light to pass 
through — let touch screens operate, collect 
current from solar cells without blocking the 
light needed to produce that current, and inject 
current to power organic light-emitting diodes 
(OLEDs) while allowing most of the photons 
to escape. 

Much graphene research has focused on the 
material’s remarkable electronic abilities and 
the possibility of using it to build transistors. 
However, it competes there with a well-estab- 
lished technology — silicon. As James Tour, a 
chemist at Rice University in Houston, Texas, 
puts it: “Silicon has had a 50-year head start, 
trillions of dollars and millions of person years.” 
So some researchers are starting to explore pos- 
sibilities that combine graphene’s electronic 
properties with its light-handling characteris- 
tics, as well as other attractive properties such 
as strength and flexibility. Transparent elec- 
trodes today generally rely on indium tin oxide 
(ITO). But indium is rare, and its price is rising 
as demand for smartphones and tablets soars. 
And because ITO is brittle, it must be depos- 
ited on an unbending surface (usually glass), 
thus placing constraints on device design. With 
a thin film of graphene as a conductive mate- 
rial, manufacturers could build flexible organic 
solar cells; OLED panels could be wrapped 
around furniture, automobile frames or a wrist. 
Graphene electrodes could make possible light- 
weight smartphones that could be folded or 
rolled up and whose screen wont crack when 
dropped. At the Consumer Electronics Show 
in early 2011, Korean electronics firm Samsung 
displayed a flexible active-matrix OLED phone 
screen based on graphene, but the company has 
not announced any products incorporating this 
technology. 

Tour concedes that graphene’s performance 
characteristics are not as good as ITO’s, at least 
not yet. A thin film of ITO lets about 90% of 
the light pass through it, and has electrical 
resistance of 50 ohms. A single-layer sheet of 
pristine graphene allows more light through — 
97.7% — but has far higher resistance, between 
2,000 and 5,000 ohms. Adding more layers of 
graphene improves resistance but worsens 
transparency; four layers provide transparency 
roughly comparable to ITO’s but, at 350 ohms, 
it still has seven times the resistance. 

But graphene may not need to be as good 
electrically as ITO to get into the marketplace 
— being cheaper may be enough. And in the 
meantime, researchers are working to enhance 
the carbon monolayer’s performance. Tour is 
building a touchscreen that contains a thin mesh 
of aluminium nanowires between the substrate 
and the graphene. Only 5 micrometres in diam- 
eter, the wires are too slender to block light but 
they make the device dramatically more con- 
ductive. A screen with 90% transparency now 
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has a resistance of only 20 ohms. Researchers at 
Samsung and at Sungkyunkwan University in 
Seoul have collaborated to obtain similar results 
— 90% transparency with 30 ohms resistance — 
by mixing in nitric acid. And these South Korean 
groups have shown they can produce sheets of 
the improved graphene that measure 30 inches 
diagonally, using a roll-to-roll process that would 
be convenient for manufacturing. 

Finding efficient production processes is part 
of the challenge of introducing graphene trans- 
parent electrodes. Tour has devised a method 
to create thin ribbons of graphene by treating 
carbon nanotubes with sulphuric acid, causing 
them to unzip into ribbons. He then attaches to 
the ribbons’ edges molecular groups that allow 
the material to be suspended in a solution and 
sprayed ontoa surface. This approach could allow 
transparent electrodes to be painted onto large- 
area solar cells or OLEDs, potentially providing 
an inexpensive alternative to silicon solar cells. 

Andrea Ferrari, a nanotechnologist at the 
University of Cambridge in the United King- 
dom, has made graphene-based inks that an 
inkjet printer can use to make transparent elec- 
trodes and circuits; graphene ink and polymers 
could be used to create thin-film solar cells, he 
suggests. These wouldn't convert as much light 
to electricity as a silicon solar cell, but because 
they're so lightweight and transparent they could 
be installed on the facades of buildings, captur- 
ing sunlight that would otherwise go to waste 
without inducing any structural strain or ruin- 
ing the aesthetics. A thin-film solar cell could be 
transparent enough to cover a window pane, yet 
still produce enough energy to make a dentin an 
electricity bill. Filling niches where silicon isn't 
appropriate could be a small but still valuable 
role for graphene, Ferrari says. “Even though it 
works much worse than the state of the art, it has 
the advantage of being flexible and cheap” 


GAPS IN THE SPECTRUM 
Graphene’s most attractive optical property, 
according to Ferrari, is that it “works at absolutely 
any wavelength whatsoever” That is, the mate- 
rial absorbs the same fraction of light — roughly 
2.3% —allacross the spectrum, from the ultra- 
violet into the far infrared. This property arises 
from the electronic structure of graphene: it lacks 
abandgap — a range of energy states where elec- 
trons are prohibited from existing. Graphene’s 
ability to absorb across the spectrum is impor- 
tant because the way light interacts with different 
materials — whether it’s absorbed or scattered or 
passes though — varies depending on its wave- 
length. One wavelength may be ideal for cutting 
acertain material, another for detecting particu- 
lar organic molecules. But at some wavelengths, 
there's no device to perform those tasks, at least 
one that isn’t prohibitively expensive. Much of 
the electromagnetic spectrum, Ferrari notes, has 
“never been exploited, because it’s hard to make 
semiconductors that work at those wavelengths” 
One such untapped portion of the spectrum 
is between the far infrared and microwaves, or 


frequencies between about 100 and 10,000 giga- 
hertz (0.1-10 terahertz). Because such terahertz 
radiation penetrates a wide variety of materials 
but is non-ionizing, it has potential applications 
in security scans, environmental sensing, and 
medical imaging. Currently, however, sources 
and detectors that operate at those frequencies 
are few and complex; some terahertz detectors 
require cryogenic cooling, whereas a graphene 
detector could work at room temperature. “Some 
people refer to it as the last gap in the electromag- 
netic wavelengths,’ says physicist Feng Wang, 
head of the Ultrafast Nano-Optics Group at the 
University of California, Berkeley. 

Wang built a terahertz detector based on 
nanoribbons of graphene ranging in width 
between 1 and 4 micrometres. The detector is 
based on plasmon resonance: when light strikes 
a material whose electrons resonate with a fre- 
quency that matches the light’s wavelength, the 
result is oscillations in electron density called 
plasmons, which can enhance the strength of 
a signal. (The colours in stained glass windows 
are so vibrant because of plasmon resonance 
between visible light and metal nanoparticles 
embedded in the glass.) In graphene, this plas- 
mon resonance happens at terahertz frequen- 
cies. Wang designs devices to detect particular 
frequencies by choosing the width of the ribbons, 
and then further tunes the device by applying an 
electrical field. Creating such a detector is the 


A graphene-based touch screen in operation. 


beginning of developing a set of tools for the 
terahertz range, he says. 

Graphene may be valuable not only as a 
detector of terahertz radiation, but also as a 
source, Wang says. There are already quantum 
cascade lasers that emit at frequencies of 10 to 
100 THz. The most useful frequencies, however, 
lie between about 500 GHz and 4 THz, and the 
only existing sources in that range are limited to 
very low output power. Wang points out that as 
researchers push to create faster transistors that 
operate at these speeds, they could in effect be 
creating an emission source in that range as well 
— just as conventional megahertz-speed tran- 
sistors produce radio waves. Pushing emitters to 
higher and higher frequencies opens up the pos- 
sibility of higher bandwidth communications. 
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Although the terahertz range is the area that's 
most sorely lacking emitters and detectors, gra- 
phene could fill in other gaps in the spectrum as 
well, because “you have a sort of universal detec- 
tor’, says Phaedon Avouris, head of the nanoscale 
science and technology group at IBM’s Watson 
Research Center in Yorktown Heights, New York. 
Current technology does not offer fast photo- 
detectors at mid- to far-infrared wavelengths; 
Avouris'’s team built an experimental graphene- 
based detector for this region of the spectrum 
that operates at a speed of 40 GHz. And, Avouris 
contends, because of imperfect measurement 
tools, it's possible that the graphene device could 
actually operate at 500 GHz or higher. 

As for emitters, Ferrari says graphene might 
soon be used in ultrafast lasers. Such lasers pro- 
duce short, high-energy bursts of light, lasting 
only picoseconds (trillionths of a second) or 
femtoseconds (thousandths of a picosecond). 
Delivering a lot of energy ina short burst lets 
the lasers cut very precisely without damag- 
ing nearby material or tissue; they’re used in 
applications from eye surgery to printed-cir- 
cuit-board processing. Most ultrafast lasers 
use a technique called mode-locking, in which 
a device known as a saturable absorber soaks 
up continuous laser emission and releases it 
ina series of pulses. The absorbers are usually 
semiconductor mirrors. But such systems are 
complex to build, and they generally emit light 
at only a narrow range of wavelengths. Using 
graphene as the saturable absorber would not 
only allow a simpler design but also permit 
broader spectrum operation. 


SMALL AND FAST 

In addition to emitting and detecting light, pho- 
tonics systems often require an optical modu- 
lator, which encodes signals onto light beams. 
Optical telecommunications use a modulator 
built of lithium niobate, and that’s unlikely to 
be displaced for the majority of applications, 
says Wang. His group demonstrated a graphene 
modulator about 25 square micrometres in size 
— 4 million times smaller than a conventional 
modulator made from lithium niobate. Such a 
small device could be integrated on a computer 
chip for optical interconnections, replacing cop- 
per wires with light beams and clearing a bot- 
tleneck for processing speed. The modulators 
should also work up to 10 times faster than exist- 
ing devices and, unlike lithium niobate, could be 
used in the mid-infrared, giving system design- 
ers more flexibility. 

It will take some time to figure out the impact 
on photonics of these emerging graphene tech- 
nologies, Ferrari says. “Graphene optoelectron- 
ics is only three years old,” he says. “We need to 
give it another two or three years to see where 
weare going” By that time, this toddler technol- 
ogy might be starting to light the way forward in 
ways not yet conceived. m 


Neil Savage is a science and technology 
journalist based in Lowell, Massachusetts. 
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Q&A Tomas Palacios 
Taking charge 


Nature Outlook talks to the first director of the MIT’s Centre 
for Graphene Devices and Systems, which was created in 
July 2011 to foster collaboration among academic, industrial 
and government groups studying this form of carbon. 
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What is the appeal of graphene? 

Graphene is an extreme material. There is 
nothing thinner or with better transport 
properties than this one-atom-thick layer of 
carbon atoms. These and other properties 
offer tremendous opportunities in electron- 
ics, materials science and chemical engineer- 
ing. In addition, the initial work on graphene 
has motivated research in many other layered 
materials, such as boron nitride and tungsten 
disulphide. All these materials offer the ulti- 
mate nanotechnology dream: atomic control at 
the macroscale. 


What’s the big deal about single layer? 
High-frequency transistors require the charge 
carriers to be as close as possible to the gate 
electrode that controls the flow of current. In 
conventional materials, like the silicon used in 
ordinary transistors, the electrons spread in the 
vertical direction by at least 5 to 10 nanometres. 
This reduces the gate electrode’s control of these 
electrons, which in turn limits how small and 
fast the transistors are. Confining the carriers 
to one layer of atoms preserves that control. In 
semi-transparent electrodes for flat panel dis- 
plays, graphene’s single-atom-thickness mini- 
mizes light absorption and makes for almost 
transparent layers. 


How well characterized are graphene’s 
properties? 

We understand its low-energy transport prop- 
erties quite well. However, we still need a lot 
more work to develop devices and systems that 
can benefit from these properties. One exam- 
ple is the very long mean free path of charge 
carriers in graphene. In many cases, charges 
can travel across graphene for more than one 
micrometre without bumping into anything. 
That is orders of magnitude farther than in 
other materials. This reduced scattering could 
enable a new generation of devices inspired by 
the old vacuum-tube electronics, where the 
electrons are controlled not only by electric 
fields but also by magnetic fields. Somewhat 
related is the possibility of measuring the quan- 
tum Hall effect at room temperature in this 
device — a phenomenon resulting from the 
interaction between an electric current anda 
magnetic field. That could be useful when you 
need a precise measure of resistance or voltage 
— such as those needed in metrology or high- 
speed communications. 


How widespread are the potential 
applications of graphene in electronics? 

The first commercial application will probably 
be as a semi-transparent electrode in solar cells, 
light-emitting diodes (LEDs) or touch screens. 
The metal used in transparent conducting films 
is expensive; graphene could be much cheaper, 
and much more flexible and robust. Several 
companies, including South Korea's Samsung 
and Spain’s Graphenea Nanomaterials, either 
already have some initial products or plan to have 
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prototypes and products on the market soon. 

Another interesting application is in micro- 
processors. As transistors get smaller and 
smaller, the diameter of the wires connecting 
them also needs to shrink. With conventional 
metallic conductors, defects and grain bounda- 
ries in the structure of crystalline metals can 
cause electrical resistivity to increase quickly at 
scales smaller than 50 nm. Because graphene is a 
single crystal, there are no grain boundaries, and 
its resistivity should stay roughly constant with 
diameter. That makes it ideal for low-resistance 
interconnections in sub-20-nm transistors. 

Once some of these initial applications are 
successful, many others will follow. Look at 
optics, for example. Graphene absorbs light 
uniformly across the entire spectrum, from the 
infrared to the ultraviolet — that’s true of very 
few other materials. We are taking advantage 
of this to develop a new generation of night- 
vision devices, which will be cheaper and able 
to cover larger fields of view. We hope to build 
an advanced prototype in 2012. 


Those are improvements on known technology. 
What about things that would be possible only 
with graphene? 

The development of a room-temperature 
quantum Hall effect device could enable a new 
generation of electronic circuits based on the 
precise voltage and resistance references that 
it would enable. The high performance of 
graphene devices over very large areas would 
also allow low-cost electronics in novel forms; 
think of a coffee cup that displays today’s news 
headlines, windows that display the outside 
temperature, and clothing that acts as antennae 
to extend the range of phones. Researchers are 
also investigating the use of graphene devices to 
steer electron beams in a way similar to what we 
do today in optics with light. This could revo- 
lutionize optical communications and enable 
new approaches for quantum computation and 
reconfigurable computers. 


Where is graphene on the development 
continuum from research to manufacturing? 
Devices involving semi-transparent electronics 
are very close to the market. Other applications, 
such as the use of graphene in computation or 
in very-large-area electronics, will take quite a 
bit longer. This is, among other reasons, due 
to the highly interdisciplinary work required 
for these applications. Methods to grow high- 
quality, large-area graphene films need to be 
developed; the fabrication technologies have 
to be modified to prevent the degradation of 
this one-atom-thick material during device 
fabrication; and, finally, new ways to integrate 
graphene devices and systems with more con- 
ventional electronics need to be invented. 


What more futuristic applications do you 
imagine? 

We are working on high-frequency graphene 
electronics — on clothes, paper and other 


items — to communicate with smartphones. 
We are collaborating with industrial partners 
on graphene-based patches to monitor glucose 
levels in the blood of people with diabetes. Every 
object would just need a graphene-based sticker 
to connect to the Internet. In collaboration with 
the International Iberian Nanotechnology Labo- 
ratory in Braga, Portugal, we are developing a 
graphene sensor to detect E. coli in food. 
Another promising area is the use of 
graphene in supercapacitors. The key here is 
to increase the surface-to-volume ratio of the 
electrodes that form the capacitor. To do this, 
researchers have recently used forests of car- 
bon nanotubes, but only half of the nanotubes 
surface is exposed to the electrolyte; the other 
half is shielded by shape of the nanotube. With 
graphene, the entire surface — both sides of 
the layer of carbon — is available. This greater 
exposure enables higher energy storage. 
Something similar happens with conductive 
fibres. Carbon nanotubes are now used to make 
polymer fibres conductive, but the nanotubes 
only touch ata single point. When two graphene 
flakes touch, on the other hand, they do so over 
a much larger area. This more intimate contact 
increases the electrical and thermal conductivity. 


Does any particular electronics stand to 
benefit from graphene? 

Terahertz electronics presents a good oppor- 
tunity. Most of the digital electronics in our 
computers work at a frequency of a few giga- 
hertz (GHz). The analogue electronics used 
for wireless communications operates in the 
2-40 GHz range, and a few radar and imaging 
systems work at 100-200 GHz. However, we 
have limited options today if we want electron- 
ics operating at frequencies in the 300 GHz to 3 
terahertz (THz) range 


What does graphene offer to terahertz 
electronics? 

Graphene’s high charge mobility, in combi- 
nation with its one-atom thickness, makes it 
attractive for filling the ‘terahertz gap. Devices 
operating at this range would work in advanced 
imaging systems, secure communications and 
chemical sensors based on terahertz spectros- 
copy. We need to find, however, a solution to 
graphene’s lack of bandgap — a range of ener- 
gies that the electrons are unable to occupy. 


What specific areas of technology will the 
centre focus on? 

We will soon see a pipeline of graphene appli- 
cations beginning to emerge, and our centre 
will be an integral part of that. For that, we 
pursue a multidisciplinary approach where 
we are bringing together physicists, materials 
scientists, and chemical and electronic engi- 
neers to help move graphene from being a 
really amazing material to being an engineer- 
ing reality that opens new industries. That 
requires collaboration between academia, 
industry and government. 
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What kind of support is the centre receiving 
from industry? 

If we want graphene to be successful beyond 
basic science it is important that industry adopts 
it and develops products with it. In the centre, 
we have industrial members who provide 
highly complementary expertise and are help- 
ing to create a ‘graphene ecosystem. The Santa 
Clara, California, company Applied Materials, 
for example, a leader in processing equipment 
for the semiconductor industry, is excited about 
adapting its technologies to graphene. Another 
one of our member companies, STMicroelec- 
tronics, is working with us in the development 
of the next generation of graphene chemical and 
biological sensors, while the Dutch company 
DSM is contributing its expertise in chemistry 
and advanced materials. 


Where does the government enter the centre’s 
planning? 

Agencies such as the Office of Naval Research 
and NASA have been supporting graphene 
research for many years through several pro- 
grammes. We are currently working with them 
to develop detailed roadmaps for device and 
system insertion. 


Does your centre plan to study and develop 
any two-dimensional materials other than 
graphene? 

Graphene is an amazing material but it isn’t nec- 
essarily the best material for every application. 
We have extensive work on 2D materials such 
as boron nitride or tungsten disulphide, materi- 
als that could be very interesting in high-power, 
high-temperature applications, among others. 


Every technology faces hurdles in going from 
lab to market. What particular challenges does 
graphene face? 

We face several. First, the commercialization of 
graphene at affordable prices requires the devel- 
opment of mass production techniques. Second, 
many applications, most notably digital elec- 
tronics, require a material that has a bandgap — 
a demand that would seem to rule graphene out. 
Finally, we need to create an ecosystem for gra- 
phene in which companies interested in devel- 
oping growth and processing equipment talk to 
companies interested in devices and materials. 


What do you hope the centre will have 
achieved in its first five years? 

Our ultimate goal is to transform graphene 
from a wonder material to an engineering 
reality. By 2016, I hope we will have generated 
commercial applications in electronics, chem- 
istry and materials that take advantage of gra- 
phene’s unique properties. I am certain that it 
will transform many fields. At the MIT Centre 
for Graphene Devices and Systems we are try- 
ing to speed up this process. = 


Interview by Peter Gwynne, a freelance science 
writer based in Sandwich, Massachusetts. 
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PERSPECTIVE 


uch of graphene’s appeal comes from its extraordinary 
M electronic properties that might one day make super-high- 

speed devices a reality, but graphene has more to offer than 
just its high electron mobility. It has other attributes deserving of 
attention — in particular, taking advantage of its chemistry to make 
new materials. 

We recently made an activated microwave-expanded graphite 
oxide (a- MEGO) by activating a graphene-like precursor with potas- 
sium hydroxide’. This porous material is composed of atom-thick 
walls of trivalently bonded carbon and pores 0.6-5.0 nanometres 
in diameter. It has a remarkably high surface area — 3,100 metres 
squared per gram — some 17% higher than 
anything possible with graphene itself. 
And it came as a complete surprise to us; 
its carbon lattice includes heptagons and 
octagons that form regions of so-called 
negative curvature — or the familiar sad- 
dle shape. We expected to get separate per- 
forated thin sheets with many atoms along 
the sheet edges. 

Saddle-shaped ‘carbon nanofoams have 
never been made in bulk and have mostly 
remained a theory; if they could be made in 
a continuous sheet form tens of microme- 
tres thick, they would likely be the ultimate 
electrode material for very high energy 
density and ultrafast supercapacitors (thus 
having extremely high power density as 
well), which store electric charge. Until 
recently, supercapacitors have had much 
lower stored charge per unit weight or vol- 
ume than batteries. But carbon a~-MEGO 
gives supercapacitors energy densities 
comparable to that of conventional lead- 
acid batteries — but they are able to deliver 
electricity much faster. Supercapacitors 
with such high-performing carbons could 
find use in the electric grid, consumer 
electronics, power tools, and to replace the 
lead-acid batteries in cars, but with a much 
longer cycle life into the hundreds of thousands (thus surpassing the 
lifetime of most cars). 

Treating graphene as a reactant rather than a product opens up 
other possibilities for novel and unexpected things. Graphene could 
be transformed into diamond without the conditions of the Earth’s 
core. For example, stacks of graphene layers could be converted to 
a product that has a mixture of trivalently and tetravalently bonded 
carbon atoms — or perhaps only tetravalently bonded, like diamond. 
Such a material might have a tensile strength far exceeding the 5.6 
gigapascals achieved by the carbon fibres that hold the world record in 
strength. There is rapid progress towards making metre-size sheets of 
graphene; on the research front, there is intense interest in improving 
reel-to-reel production to make, say, kilometre lengths of graphene 
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Particles of crumpled graphene sheets created by 
capillary compression in evaporating aerosol droplets®. 


A means to an end 


Exploring graphene’s chemical properties reveals a world of potential 
away from the purely two-dimensional, says Rodney Ruoff. 


foils. It is time to consider clever ways to convert such graphene-based 
foils to nanometres-thick diamond foils. This can probably be done 
using low-energy chemistry, either by building up from monolayer 
graphene or by stitching adjacent layers together’. This route might 
provide a form of diamond that would be ultrastrong over lengths of 
metres or longer, while keeping diamond's alluring properties intact. 
Doping very large-area diamond foils with nitrogen or other elements 
could render it electrically conductive for nanoelectronics — a product 
that might use nitrogen-doped graphene as the precursor. 

Another interesting phenomenon with graphene is how its physi- 
cal deformation affects its chemical activity. When crumpled, the 
carbon atoms along a fold are more reac- 
tive’.One can — at least in principle — 
tune the reaction barrier and reaction 
energetics by manipulating physical form. 
Very well-defined ripples or edges with 
curves of a few nanometres and spaced 
apart by, say, tens of nanometres would 
activate select rows of carbon atoms to 
achieve chemical patterns. That type of 
control over reactive sites could be impor- 
tant in nanoelectronics and nanoelectrome- 
chanical systems. 

Depositing graphene on substrates pat- 
terned by nanoimprint lithography to cre- 
ate rows of highly curved regions would 
thereby allow a particular chemical function 
in certain regions. One could, for example, 
create insulating zones that border highly 
conductive pristine graphene zones to pro- 
duce graphene nanoribbons. Alternatively, 
highly curved regions could be patterned by 
deposition onto arrays of tiny dimples, and 
then etched away to render an array of very 
small holes to make a nanomesh for filters 
and other applications. 

Graphene platelets also deform naturally. 
Noteworthy forms of crumpled graphene are 
the reduced graphene oxide (rG-O) sheets 
embedded in a polymer’ and more recently 
made’ by drying aerosol droplets. If this crumpling leads to localized 
highly reactive regions, we might find that its durability is influenced 
by certain chemical environments — and also that it would work well 
for certain applications when mangled. = 


= 


Rodney Ruoff is professor of mechanical engineering and materials 
science and engineering at the University of Texas in Austin. 
email: r.ruoff@mail.utexas.edu 


. Zhu, Y. et al. Science 332, 1537 (2011). 

. Dikin, D. A. et al. Nature 448, 457-460 (2007). 

. Srivastava, D. et al. J. Phys. Chem. B 103, 4330-4337 (1999). 
. Stankovich, S. et al. Nature 442, 282-286 (2006). 

. Luo, J. et al, ACS Nano 5, 8943-8949 (2011). 


aARWNHHE 


© 2012 Macmillan Publishers Limited. All rights reserved 


JIAXING HUANG 


ASTRID & HANNS-FRIEDER MICHLER/SCIENCE PHOTO LIBRARY 


MATERIAL HISTORY 


Learning from silicon 


Siliconis more than an incumbent technology competing with graphene — it also has a 
history researchers should remember. 


BY MICHAEL SEGAL 


and in electronics it is usually based on 

silicon. Because of its intrinsic proper- 
ties, its abundance and the ease with which it 
can be processed, silicon has dominated the 
semiconductor industry. It would take remark- 
able optimism to believe that a new material 
could supersede it. 

Yet graphene has inspired such optimism. 
Its unique electrical characteristics, thinness 
and processability have prompted speculation 
that it could be the next silicon (see “Back to 
analogue; page $34) — a tall order indeed. But 
whether or not graphene succeeds, the com- 
parison to silicon is relevant in another sense: 
silicon’ history gives insight into what to expect 
from graphene. 

First, silicon teaches us that it’s hard to pre- 
dict the strongest or most valuable use for a 
novel material. Although most people associ- 
ate silicon with electronic and optical devices, 


Be new technology faces an incumbent, 


that was not where the material first made its 
mark. Back in the 1890s, silicon was a ‘poor 
man’ alloying agent’’ used in steel and alu- 
minium metallurgy, competing with nickel, 
chromium and manganese. In the 1930s, US 
chemical giant DuPont intensively investigated 
silicon as an alternative to lead-based pigments 
in white paint. 

These early uses were unrelated to the device 
in which silicon was to have its most profound 
impact — the transistor — but were essential 
for that technology’s eventual development. 
That's because each application led to methods 
for producing large volumes of highly pure sil- 
icon. Research into silicon electronic devices 
began in the late nineteenth century with the 
discovery that electrical current between elec- 
trodes attached to silicon and other semicon- 
ductors flowed more easily in one direction 
than the other. This led to the development of 
semiconductor rectifiers, devices that convert 
alternating current into direct current, which 
were used for communication and radar 
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in both world wars. However, because only 
impure and granular semiconductors were 
available, the rectifiers were unreliable and hard 
to manufacture. Bell Labs and other organiza- 
tions applied or adapted the production meth- 
ods developed to purify silicon for metallurgy 
and paint to crystal rectification. This conver- 
gence was instrumental in the development of 
the first germanium transistor in 1947, and the 
first silicon transistor in 1954. 

Something similar is going on now with 
graphene. The Nobel-prizewinning work on 
graphene in 2004 included an electronic appli- 
cation — a transistor — and electrical devices 
were an early focus of research’. Since then, 
however, an increased understanding of gra- 
phene’s mechanical, thermal and optical prop- 
erties has broadened the 


scope of potential applica- NATURE.COM 
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include conductive inks and composites, 
transparent or high-surface-area electrodes, 
impermeable membranes, nonlinear optics, 
and nanoscale pores. 

Some of these applications have spurred 
improvements in the way graphene is manu- 
factured, as was the case with silicon and met- 
allurgy. For example, the most cited graphene 
publication in the past two years describes not 
a device but rather a way to make large pieces of 
graphene for large-area transparent electrodes’. 
But many of the papers that cite this work apply 
the technique to devices in which graphene is 
an active electronic component. Similarly, one of 
the first mass-market applications of graphene is 
likely to be a conductive ink whose performance 
depends on a unique processing approach. 

As with silicon, then, early graphene appli- 
cations do not rely on its more advanced elec- 
trical characteristics, and are closely tied to 
steps forward in material fabrication and pro- 
cessing that are being adopted by the broader 
community. And, as with silicon, these early 
applications may turn out not to be graphene’s 
greatest hit. 


SWITCH HITTER 
Silicon has achieved its biggest success as an 
electronic switch. More than a million trillion 
transistors are made each year’. But the story 
of the switch goes beyond silicon: the develop- 
ment of an electronic switch capable of ampli- 
fying a signal precedes the silicon transistor by 
about 50 years. The vacuum tube triode was 
developed around 1906 and by the 1940s was 
found in radios, TVs and computers. And 
whatever technology brings, the switch should 
remain a central feature: digital information is 
created and processed with switches, after all, 
and even our brains rely on a sort of switch. 
Therefore, even with all of graphene’s poten- 
tial applications, an electronic graphene switch 
has special status. And, at least on first blush, it 
appeared that the next great switch might run 
on graphene. Much of the initial optimism was 
based on graphene’s remarkable charge mobil- 
ity, which is about a thousand times higher 
than that of silicon. Although this mobility 
has led to high-performance radio-frequency 
devices’, it may not be particularly useful for 
digital logic’. Moreover, graphene is missing a 
key characteristic required for building digi- 
tal logic using traditional transistor designs: a 
bandgap (a range of energies in which electrons 
cannot exist). Already, other 2D crystals such 
as MoS,, which does have a bandgap, are nip- 
ping at graphene’s heels’. The situation recalls 
the competition between silicon and materials 
such as germanium, which has higher mobili- 
ties, and gallium arsenide, which is better suited 
to optical applications such as lasers. Despite 
these disadvantages, silicon dominated because 
it is easier to process, has a better oxide, and is 
more abundant. Graphene cannot depend ona 
single characteristic — electron mobility — for 
it to succeed as a material. 


Like silicon, however, graphene brings with 
it a host of new physics that may change our 
understanding of the relevance of mobility and 
bandgap for transistors. Scientists are exploring 
alternative device architectures that exploit this 
new physics, including tunnelling transistors® 
and transistors that guide and focus electrons 
like light. 

And switches may turn out not be an impor- 
tant part of graphene’s story after all. Kroemer’s 
Lemma of New Technology occasionally comes 
up in discussion of graphene. This principle, 
named after Nobel laureate Herbert Kroemer, 
states that the primary application of a new 
technology is one created by that technology. 
If it holds here, graphene’s killer app may be 
something that nobody has yet imagined. 


GRAPHENE INKS SOME PROFITS 

Although the future may be murky, graphene 
is expected to make its consumer product 
debut in 2012. Mead Westvaco, a Fortune 500 
packaging manufacturer based in Richmond, 
Virginia, is using graphene-based conductive 
inks to make a new kind of package with an 
integrated anti-theft 
security system. The 


Just acouple technology enables 
of years af ter ‘open merchandising, 
the Nobel Prize in which products are 
wae awarded, readily available for 
graphene is consumers to hold or 
already on see up-close. Open 
store shelves. 


merchandising has 
traditionally been 
associated with heightened security risks, 
particularly for high-value items packaged in 
paperboard or plastic. 

MeadWestvacos solution is to use graphene- 
based inks to monitor whether the package has 
been tampered with. The inks form a conduc- 
tive trace that winds around the inside of the 
package and sticks to the product itself. A plastic 
tab attached to the top of the package contains 
powered circuitry connected to the trace; if the 
packaging is cut, or the tab or product removed, 
an alarm built into the tab sounds. Carrying the 
package out of the store also triggers the alarm. 

Previous attempts at secure packaging like 
this had foundered because the right conduc- 
tive ink couldn't be found. Polymer and tradi- 
tional carbon-based inks are not sufficiently 
conductive; silver inks are expensive and harder 
to process. The graphene inks (supplied by 
Vorbeck Materials of Jessup, Maryland) work 
well because they contain mostly single-layer 
graphene sheets that are deliberately wrin- 
Kled, preventing them from re-stacking into a 
graphite-like material. Instead, the sheets cre- 
ate a network of pathways for current to flow, 
resulting in films that are highly conductive 
even at thicknesses of a few hundred nanome- 
tres. Such thin, flexible layers can be printed 
using newspaper-style roll-to-roll presses that 
are capable of high-volume production. David 
W. Miller, global director of security packaging 
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systems and supply chain at Mead Westvaco, 
says that major retailers have expressed an 
interest in deploying the packaging — and 
that one of them has already started testing it. 
Just a couple of years after the Nobel Prize was 
awarded, graphene is already on store shelves. 

As was the case with metallurgical silicon in 
the nineteenth century, materials processing 
competence is allowing graphene to compete 
with established materials on this first foray 
into the mass market. Volumes are expected 
to be high, stimulating further improvements 
in manufacture. Vorbeck has indicated that 
almost all of the revenue it earns from its inks 
will be reinvested into research and develop- 
ment efforts to broaden the ink’s appeal by, for 
example, increasing its conductivity to compete 
with silver inks used in solar cells. And Vor- 
beck’s graphene inks leave the material's more 
exotic properties (Dirac fermions, valley polari- 
zation and so on) for future products to exploit. 


ARE WE THERE YET? 

So graphene is in the process of leaving the 
laboratory. But will we have to wait as long as 
we did with silicon — about 130 years from 
first isolation to the transistor — for graphene’s 
golden age to arrive? Even if we must, we are 
already decades along the graphene develop- 
ment timeline. Thin graphite was being studied 
experimentally in the 1960s, and calculations 
on graphene were being performed earlier still. 
Much of the physics, chemistry and engineer- 
ing of graphene are similar to that of nanotubes 
and fullerenes. In fact, many of the physicists 
who now study graphene have moved from 
research programmes on nanotubes — some- 
thing reflected in the drop in growth of nano- 
tube research papers just as the volume of 
graphene papers shot up after 2004. 

At the same time, we are in the early or mid- 
dle stages of graphene synthesis and processing. 
Key techniques, such as large-area chemical 
vapour deposition, have been developed in 
the past four or so years and continue to evolve 
(see ‘Beyond sticky tape, page S32). And the 
number of potential applications continues to 
increase. Silicon’s history teaches us to prior- 
itize processing, allow for surprises, be patient, 
and keep a guarded (and resilient) optimism. 
Things are, after all, off to a good start. 


Michael Segal is a senior editor of Nature 
Nanotechnology. 
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